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VECTORS FOR OCULAR TRANSDUCTION AND USE THEREOF FOR GENETIC 

THERAPY 

Work described herein was supported by NIH grants EY1 1431 and 
HL54352. The government has certain rights in such subject matter. 

5 RELATED APPLICATIONS 

This application claims the benefit of priority to U.S. application Serial No. 
09/562,934, filed May 1, 2000, to Glen R. Nemerow, Daniel Von Seggern,; 

Martin Friedlander, entitled "VECTORS FOR OCULAR TRANSDUCTION AND USE 
THEREFOR FOR GENETIC THERAPY". 

10 This application is related to copending U.S. application Serial 

No. 09/482,682 (also filed as International PCT application No. 
PCT/US00/00265, filed January 14, 2000)), to Daniel Von Seggern, Glen R. 
Nemerow, Paul Hallenbeck, Susan Stevenson, Yelena Skripchenko, filed January 
14, 2000, entitled "Adenovirus Vectors, Packaging Cell Lines.Compositions, and 
15 Methods for Preparation and Use," which is a continuation-in-part of U.S. 

Application 09/423,783 filed November 12, 1999 and claims the benefit of the 
filing date of U.S. Provisional Application 60/115,920 filed January 14, 1999. 
Where permitted, the contents and subject matter of each application and of the 
provisional application are incorporated in their entirety herein by reference. 

20 FIELD OF INVENTION 

The present invention relates to gene therapy, especially to adenovi/us 
vector-based gene therapy. In particular, adenovirus vectors for therapy of 
ocular diseases and methods of treatment using the vectors are provided. 
Compositions, kits, and methods of preparation and use of the vectors for gene 

25 therapy are provided. 

BACKGROUND OF THE INVENTION 

Retinal dystrophies 

The eye is susceptible to a number of hereditary and/or age related 
degenerative disorders. In the United States, common causes of irreversible 
30 blindness or severe loss of vision are retinal dystrophies (see, e.g., Cotlier eta/. 
(1995) Surv. Ophthalmology 40:51-61; Bird (1995) Am. J. Ophthal. 119. 
543-562; and Adler (1996) Arch Ophthal 1 74:79-83). The retina is the sensory 
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tunic of the eye, containing light sensitive receptors, a complex of neurons, and 
pigmented epithelium, arranged in discrete layers. In humans, the macula is the 
portion of the retina that lies directly behind the lens. Cones, the photoreceptor 
cells responsible for central vision, are heavily concentrated in the macula. 

5 Central dystrophies, which affect the macula, include Best's disease, age-related 
macular degeneration, and Stargardt's macular dystrophy. The peripheral retina 
is composed mainly of rods, which are responsible for side and night vision. 
Peripheral degenerative retinal diseases include retinitis pigmentosa, choroidemia 
and Bietti's crystalline dystrophy. 

10 Macular degenerations are a heterogenous group of diseases, 

characterized by progressive central vision loss and degeneration of the macula 
and underlying retinal pigmented epithelium. Age-related macular degeneration 
(ARMD) is the most common form of the disease, affecting an estimated 20% of 
persons over 75 years of age. ARMD is poorly understood in terms of etiology 
1 5 and pathogenesis. The very late onset of the disease has made genetic mapping 
particularly difficult. Certain macular degenerative conditions with a clear genetic 
basis, such as Stargardt's and Best's diseases, share many features with ARMD, 
but have been more amenable to molecular and genetic analysis. 

Hereditary peripheral retinopathies are also relatively common. Retinitis 
20 pigmentosa (RP), for example, affects approximately 1 .5 million people 
worldwide. Substantial genetic heterogeneity has been observed in this 
condition, with over 20 chromosomal loci identified. A predisposition to retinitis 
pigmentosa can be inherited by autosomal dominant, autosomal recessive, 
X-linked or digenic mode. Mutations have been identified in seven genes, four of 
25 which encode proteins in the rod phototransduction cascade: rhodopsin, alpha 
and beta subunits of rod cGMP phosphodiesterase, and rod cGMP cation-gated 
channel protein .alpha, subunit. Mutations in the peripherin/RDS gene have been 
linked to retinitis pigmentosa and macular degeneration. A single peripherin/RDS 
mutation apparently caused retinitis pigmentosa, pattern dystrophy and fundus 
flavimaculatus, in different family members. 



30 
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In spite of causal heterogeneity, there is significant clinical similarity 
among RP subtypes. Common signs and symptoms include early 
electroretinographic abnormalities, ophthalmoscopic findings, and 
protracted, contiguous expansion of the ring-like scotoma toward the macula, 

5 leading to progressively worsening tunnel vision. A recent hypothesis is that 
active photoreceptor cell death, which is characteristic of these genetically 
distinct disorders, is mediated by a common induction of apoptosis. It may be 
possible to treat these conditions by the administration of agents that block 
induction of apoptosis in photoreceptors, such as neurotrophic factors. 

10 Adenovirus delivery vectors 

Adenovirus, which is a DMA virus with a 36 kilobase (kb) genome, is very 
well-characterized and its genetics and genetic organization are understood. 

The genetic organization of adenoviruses permits substitution of large fragments 
of viral DNA with foreign DNA. In addition, recombinant adenoviruses are 
15 structurally stable and no rearranged viruses are observed after extensive 
amplification. 

Adenoviruses have been employed as delivery vehicles for introducing 
desired genes into eukaryotic cells. The adenovirus delivers such genes to 
eukaryotic cells by binding to cellular receptors followed by internalization. The 
20 adenovirus fiber protein is responsible for binding to cells. The fiber protein has 
two domains, a rod-like shaft portion and a globular head portion that contains 
the receptor binding region. The fiber spike is a homotrimer, and there are 1 2 
spikes per virion. Human adenoviruses bind to and infect a broad range of 
cultured cell lines and primary tissues from different species. 

25 The 35,000+ base pair (bp) genome of adenovirus type 2 has been 

sequenced and the predicted amino acid sequences of the major coat proteins 
(hexon, fiber and penton base) have been described (see, e.g., Neumann et aL, 
Gene 69: 153-157 (1988); Herisse et al., Nuc. Acids Res. 9: 4023-4041 (1981) 
Roberts et aL, J. Biol. Chem. 259: 13968-13975 (1984); Kinloch et aL, J. Biol. 
30 Chem. 259: 6431-6436 (1984); and Chroboczek etal., Virol. 161: 549-554, 

1987). 
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The 35,935 bp sequence of Ad5 DNA is also known and portions of 
many other adenovirus genomes have been sequenced. The upper packaging 
limit for adenovirus virions is about 105% of the wild-type genome length (see, 
e.g., Bett, et a!., J. Virol. 67(10): 591 1-21, 1993). Thus, for Ad2 and Ad5, this 
5 would be an upper packaging limit of about 38kb of DNA. 

Adenovirus DNA also includes inverted terminal repeat sequences (ITRs) 
ranging in size from about 100 to 150 bp, depending on the serotype. The 
inverted repeats permit single strands of viral DNA to circularize by base-pairing 
of their terminal sequences to form base-paired "panhandle" structures that are 

10 required for replication of the viral DNA. 

For efficient packaging, the ITRs and the packaging signal (a few hundred bp in 
length) comprise the "minimum requirement" for replication and packaging of a 
genomic nucleic acid into an adenovirus particle. Helper-dependent vectors 
lacking all viral ORFs but including these essential cis elements (the ITRs and 
15 contiguous packaging sequence) have been constructed. 

Ad vectors have several distinct advantages as gene delivery vehicles. 

For example, recombination of such vectors is rare; there are no known 
associations of human malignancies with adenoviral infections despite common 
human infection with adenoviruses; the genome may be manipulated to 
20 accommodate foreign genes of a fairly substantial size; and host proliferation is 
not required for expression of adenoviral proteins. Adenovirus (Ad)-based gene 
delivery vectors efficiently infect many different cells and tissues. This broad 
tropism, however, means that gene delivery cannot be directed to a specific 
target cell. A large fraction of intravenously administered adenovirus is 
25 retained by the liver, which could lead to undesirable side-effects. Adenovirus 
may potentiate immune responses. For example. Adenovirus type 5 (Ad5) also 
transduces dendritic cells, which present antigens very efficiently, thereby 
possibly exacerbating the immune response against the vector. It has been 
proposed that vectors with different targeting efficiencies might eliminate these 
30 problems, permitting a lower total particle dose and more specific targeting (see, 
e.g., U.S. application Serial No. 09/482,682). 
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The wealth of information on adenovirus structure and mechanism of. 
infection, its efficient infection of nondividing cells, and its large genetic capacity 
make adenovirus a popular gene therapy vector. The wide expression of 
receptors to which adenovirus binds makes targeting adenovirus vectors 
5 difficult. 

Hence there is a need to improve delivery and targeting of adenoviral 
vectors and also to provide treatments for ocular disorders. Therefore, it is an 
object herein to provide adenoviral vectors that specifically or selectively target 
cells in the eye. It is also an object herein to provide these vectors for treatment 

10 of ocular disorders. 

SUMMARY OF THE INVENTION 

Degenerative ocular diseases, such as, but not limited to, retinitis 
pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, 
and others (see, e.g.. Table below), have a genetic basis. Genes expressed in 
15 the photoreceptor cells at the back of the retina are implicated in these diseases. 
Provided herein are recombinant viral vectors for targeting therapeutic products 
to these cells. 

Recombinant adenoviral vectors that include nucleic acid that permits 
specific binding to these photoreceptors are provided. In particular, the vector 
20 particles contain a fiber protein of Ad37 or a modified form thereof. As shown 
herein, fiber protein from Ad37 permits efficient infection of photoreceptor cells. 
Fiber proteins from other adenovirus D serotypes may also be used. In addition, 
the portions of the fiber protein, particularly those that interact with other viral 
structural proteins, such as penton, may be modified to resemble the viral source 
25 of the other structural proteins. As exemplified herein, the recombinant virus 

provided herein include Ad5 structural components. The N-terminus of the Ad37 
fiber protein, which interacts with the penton protein, is modified to resemble 
the Ad5 fiber protein N-terminus to ensure production of viral particles. 

The recombinant adenoviral vectors are intended for gene therapy of 
30 diseases in which genes expressed in the photoreceptors are implicated. Such 
diseases include, but are not limited to, degenerative ocular diseases, such as 
retinitis pigmentosa and Stargardt's disease. These vectors are also useful for 
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targeting to other ocular cells, such as conjunctival cells, which also bear 
receptors to which fiber from Ad37 and related serotypes bind. 

The vectors will deliver therapeutic agents to the targeted cells for 
treatment of a variety of disorders (see e.g., Tables 3 and 4, below)). The 
5 therapeutic agents are intended for expression in the photoreceptors and for 
secretion from the photorecptor cells, which are surrounded on one side by 
choroidal vasculature, and on the other side by retinal vasulature, thereby 
providing a means for delivery of products. In addition, expression of growth 
factors, such as VEGF and others, can be used to enhance blood flow. to the 

10 retina and prevent or slow the degeneration. 

Therapeutic agents encoded by the recombinant adenoviral vectors 
include, but are not limited to, nucleic acid nucleic acid molecules encoding 
genes that are defective in certain hereditary disorders, nucleic acid molecules 
that encode antiangiogenics and antitumor agents for treatment of retinal 
15 disorders, such as retinoblastomas; nucleic acid molecules encoding trophic 
factors, such as glial cell line-derived neuroptrophic factor (GDNF) and ciliary 
neurotrophic factor (CNTF), growth factors and growth factor inhibitors, 
antiapoptotic factors, such as Bcl-2 (CNTF), antitumor agents, anti-angiogenics, 
and genes or portions thereof for gene replacement or repair of defective genes. 
20 Hence, methods for treatment of inherited and acquired retinal diseases, 

including diseases involving neovascular and vascular degeneration are provided. 

Methods for treating diseases involving genes expressed in photoreceptor 
cells are provided herein. The methods provided herein are practiced by 
25 administration of the recombinant viral vectors by any means suitable for 

delivery to the photoreceptors. A preferred mode of administration is intraocular 
injection including intravitreal and subretinal injection. Other modes of 
administration include, but are not limited to, intrascleral, periorbital and 
intravenous administration. The vectors also can include photoreceptor-specific 
30 promoters thereby providing a means, not only for specific targeting of 
expression in these cells, but also for photoreceptor-restricted transgene 
expression. 
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DET AILED DESCRIPTION OF THE INVENTION 



. A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
5 which this invention belongs. All patents, applications, published applications 
and other publications and sequences from GenBank and other data bases 
referred to anywhere in the disclosure herein are incorporated by reference in 

their entirety. 

As used herein, the amino acids, which occur in the various amino acid 
10 sequences appearing herein, are identified according to their three-letter or one- 
letter abbreviations. The nucleotides, which occur in the various DNA 
fragments, are designated with the standard single-letter designations used 
routinely in the art (see. Table 1). 

As used herein, amino acid residue refers to an amino acid formed upon 
15 chemical digestion (hydrolysis) of a polypeptide at its peptide linkages. The 
• amino acid residues described herein are preferably in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino 
acid residue, as long as the desired functional property is retained by the 
polypeptide. NH 2 refers to the free amino group present at the amino terminus 
20 of a polypeptide. COOH refers to the free carboxy group present at the carboxyl 
terminus of a polypeptide. In keeping with standard polypeptide nomenclature 
described in J. Biol. Chem., 243: 3552-59 (1 969) and adopted at 37 C.F.R. § § 

1 .821 - 1 .822, abbreviations for amino acid residues are shown in the following 



Table: 

25 



30 



Table 1 

Table of Correspondence 



SYMBOL 




1 -Letter 


3-Letter 


AMINO ACID 


Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 
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SYMBOL 




A 


Ala 


alanine 


S 


Ser 


serine 


1 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


z 


Glx 


Glu and/or Gin 


w 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


c 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



lt should be noted that all amino acid residue sequences represented 
herein by formulae have a left to right orientation in the conventional direction of 
amino-terminus to carboxyl-terminus. In addition, the phrase "amino acid 
residue" is broadly defined to include the amino acids listed in the Table of 
25 Correspondence and modified and unusual amino acids, such as those referred 
to in 37 C.F.R. § § 1.821-1.822, and incorporated herein by reference. 
Furthermore, it should be noted that a dash at the beginning or end of an amino 
acid residue sequence indicates a peptide bond to a further sequence of one or 
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more amino acid residues or to an amino-terminal group such as NH 2 or to a 



carboxyl-terminal group such as COOH. 

In a peptide or protein, suitable conservative substitutions of amino acids 
are known to those of skill in this art and may be made generally without altering 
5 the biological activity of the resulting molecule. Those of skill in this art 
recognize that, in general, single amino acid substitutions in non-essential 
regions of a polypeptide do not substantially alter biological activity (see, e^, 
Watson et ah Molecular Biology of the Gene , 4th Edition, 1 987, The 
Bejacmin/Cummings Pub. co., p.224). 

10 Such substitutions are preferably made in accordance with those set forth 



15 



20 



25 



30 



in TABLE 2 as follows: 


TABLE 2 


Original residue 
Ala (A) 


Conservative substitution 
Gly; Ser 


Arg (R) 


Lys 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His (H) 


Asn; Gin 


lie (1) 


Leu; Val 


Leu (L) 


lie; Val 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; He 


Phe (F) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (W> 


Tyr 


Tyr (Y) 


Trp; Phe 


Val (V) 


He; Leu 



Other substitutions are also permissible and may be determined empirically or in 



accord with known conservative substitutions. 

35 As used herein, a complementing plasmid describes plasmid vectors that 

deliver nucleic acids into a packaging cell line for stable integration into a 



chromosome in the cellular genome. 

As used herein, a delivery plasmid is a plasmid vector that carries or 
delivers nucleic acids encoding a therapeutic gene or gene that encodes a 
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therapeutic product or a precursor thereof or a regulatory gene or other factor 
that results in a therapeutic effect when delivered in vivo in or into a cell line, 
such as, but not limited to a packaging cell line, to propagate therapeutic viral 
vectors. 

5 As used herein, a variety of vectors with different requirements are 

described. For example, one vector is used to deliver particular nucleic acid 
molecules into a packaging cell line for stable integration into a chromosome. 
These types of vectors are generally identified herein as complementing 
plasmids. A further type of vector described herein carries or delivers nucleic 
10 acid molecules in or into a cell line ( e.g ., a packaging cell line) for the purpose of 
propagating therapeutic viral vectors; hence, these vectors are generally referred 
to herein as delivery plasmids. A third "type" of vector described herein is used 
to carry nucleic acid molecules encoding therapeutic proteins or polypeptides or 
regulatory proteins or are regulatory sequences to specific cells or cell types in a 
15 subject in need of treatment; these vectors are generally identified herein as 

therapeutic viral vectors or recombinant adenoviral vectors or viral Ad-derived . 
vectors and are in the form of a virus particle encapsulating a viral nucleic acid 
containing an expression cassette for expressing the therapeutic gene. 

As used herein, a DNA or nucleic acid homolog refers to a nucleic acid 
20 that includes a preselected conserved nucleotide sequence, such as a sequence 
encoding a therapeutic polypeptide. By the term "substantially homologous" is 
meant having at least 80%, preferably at least 90%, most preferably at least 
95% homology therewith or a lesser percentage of homology or identity and 
conserved biological activity or function. 

25 The terms "homology” and "identity” are often used interchangeably. In 

this regard, percent homology or identity may be determined, for example, by 
comparing sequence information using a GAP computer program. The GAP 
program utilizes the alignment method of Needleman and Wunsch U Mol. Biol. 
48:443 (1970), as revised by Smith and Waterman l Adv . Appl. Math. 2.482 
30 (1981). Briefly, the GAP program defines similarity as the number of aligned 

symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
number of symbols in the shorter of the two sequences. The preferred default 
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parameters for the GAP program may include: (1) a unary comparison matrix 
(containing a value of 1 for identities and 0 for non-identities) and the weighted 
comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 14:6745 (1986), 
as described by Schwartz and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE 
5 AND STRUCTURE, National Biomedical Research Foundation, pp. 353-358 

(1979); (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each 

symbol in each gap; and (3) no penalty for 
end gaps. 

. Whether any two nucleic acid molecules have nucleotide sequences that 
10 are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
determined using known computer algorithms such as the "FAST A" program, 
using for example, the default parameters as in Pearson and Lipman, Proc. Natl. 
Acad. Sci. USA 55:2444 (1988). Alternatively the BLAST function of the 
National Center for Biotechnology Information database may be used to 
15 determine identity. 

In general, sequences are aligned so that the highest order match is 
obtained. "Identity" perse has an art-recognized meaning and can be calculated 
using published techniques. (See, e.g.: Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 
20 and Genome Projects, Smith, D.W., ed.. Academic Press, New York, 1 993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., 
eds., Humana Press, New Jersey, 1 994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). 

25 While there exist a number of methods to measure identity between two 

polynucleotides or polypeptide sequences, the term "identity" is well known to 
skilled artisans (Carillo, H. & Lipton, D.. SIAM J Applied Math 45:1073 (1988)). 
Methods commonly employed to determine identity or similarity between two 
sequences include, but are not limited to, those disclosed in Guide to Huge 
30 Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and 
Carillo, H. & Lipton, D., SIAM J Applied Math 45:1073 (1988). Methods to 
determine identity and similarity are codified in computer programs. Preferred 
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computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, GCQ program package (Devereux, J. F 
eta!.. Nucleic Acids Research 12(0:387 (1984)), BLASTP, BLASTN, FASTA 
(Atschul, S.F., et al„ J Moiec Bio! 275:403 (1 990)). 

5 Therefore, as used herein, the term "identity" represents a comparison 

between a test and a reference polypeptide or polynucleotide. For example, a 
test polypeptide may be defined as any polypeptide that is 90% or more 
identical to a reference polypeptide. As used herein, the term at least 90% 
identical to" refers to percent identities from 90 to 99.99 relative to the 
10 reference polypeptides. Identity at a level of 90% or more, is indicative of the 
fact that, assuming for exemplification purposes a test and reference 
polynucleotide length of 100 amino acids are compared. No more than 10% 

(i.e., 10 out of 100) amino acids in the test polypeptide differs from that of the 
reference polypeptides. Similar comparisons may be made between a test and 
15 reference polynucleotides. Such differences may be represented as point 

mutations randomly distributed over the entire length of an amino acid sequence 
or they may be clustered in one or more locations of varying length up to the 
maximum allowable, e.g. 10/100 amino acid difference (approximately 90% 
identity). Differences are defined as nucleic acid or amino acid substitutions, or 

20 deletions. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 
into the selected target cells in a manner such that the heterologous DNA is 
25 expressed and a therapeutic product encoded thereby is produced. 

Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 
of a therapeutic product. Genetic therapy may also be used to deliver nucleic 
30 acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
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factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 
receptor therefor, that is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
5 prior to introduction into the cells of the afflicted host in order to enhance or 

otherwise alter the product or expression thereof. 

As used herein, heterologous DNA is DNA that encodes RNA and proteins 
that are not normally produced in vivo by the cell in which it is expressed or that 
mediates or encodes mediators that alter expression of endogenous DNA by 
10 affecting transcription, translation, or other regulable biochemical processes. 
Heterologous DNA may also be referred to as foreign DNA. Any DNA that one 
of skill in the art would recognize or consider as heterologous or foreign to the 
cell in which it is expressed is herein encompassed by heterologous DNA. 
Examples of heterologous DNA include, but are not limited to, DNA that encodes 
15 traceable marker proteins, such as a protein that confers drug resistance, DNA 
that encodes therapeutically effective substances, such as anti-cancer agents, 
enzymes and hormones, and DNA that encodes other types of proteins, such as 
antibodies. Antibodies that are encoded by heterologous DNA may be secreted 
or expressed on the surface of the cell in which the heterologous DNA has been 

20 introduced. 

Hence, herein heterologous DNA or foreign DNA, refers to a DNA 
molecule not present in the exact orientation and position as the counterpart 
DNA molecule found in the corresponding wild-type adenovirus. It may also 
refer to a DNA molecule from another organism or species (/.e., exogenous) or 
25 from another Ad serotype. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 

30 Typically, DNA encoding the desired heterologous DNA is cloned into a 

plasmid vector and introduced by routine methods, such as calcium-phosphate 
mediated DNA uptake (see, (1981) Somat. Cel L. Mol. Genet , 7:603-616) or 
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microinjection, into producer cells, such as packaging cells. After amplification 
in producer cells, the vectors that contain the heterologous DNA are introduced 
into selected target cells. 

As used herein, an expression or delivery vector refers to any plasmid or 
5 virus into which a foreign or heterologous DNA may be inserted for expression in 
a suitable host cell - i.e., the protein or polypeptide encoded by the DNA is 
synthesized in the host cell's system. Vectors capable of directing the 
expression of DNA segments (genes) encoding one or more proteins are referred 
to herein as “expression vectors.” Also included are vectors that allow cloning 
10 of cDNA (complementary DNA) from mRNAs produced using reverse 
transcriptase. 

As used herein, a gene is a nucleic acid molecule whose nucleotide 
sequence encodes RNA or polypeptide. A gene can be either RNA or DNA. 

Genes may include regions preceding and following the coding region (leader and 
15 trailer) as well as intervening sequences (introns) between individual coding 
segments (exons). 

As used herein, tropism with reference to an adenovirus refers to the 
selective inactivity or binding that is conferred on the particle by the fiber 
protein, such as by the C-terminus portion that comprises the knob. 

20 As used herein, isolated with reference to a nucleic acid molecule or 

polypeptide or other biomolecule means that the nucleic acid or polypeptide has 
separated from the genetic environment from which the polypeptide or nucleic 
acid were obtained. It may also mean altered from the natural state. For 
example, a polynucleotide or a polypeptide naturally present in a living animal is 
25 not "isolated," but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is "isolated”, as the term is employed 
herein. Thus, a polypeptide or polynucleotide produced and/or contained within 
a recombinant host cell is considered isolated. Also intended as an “isolated 
polypeptide" or an "isolated polynucleotide" are polypeptides or polynucleotides 
30 that have been purified, partially or substantially, from a recombinant host cell or 
from a native source. For example, a recombinantly produced version of a 
compound can be substantially purified by the one-step method described in 
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Smith and Johnson, Gene 67:31-40 (1988). The terms isolated and purified are 
sometimes used interchangeably. 

Thus, by "isolated" is meant that the nucleic acid is free of the coding 
sequences of those genes that, in the naturally-occurring genome of the 
5 organism (if any) immediately flank the gene encoding the nucleic acid of 

interest. Isolated DNA may be single-stranded or double-stranded, and may be 
genomic DNA, cDNA, recombinant hybrid DNA, or synthetic DNA. It may be 
identical to a native DNA sequence, or may differ from such sequence by the 
deletion, addition, or substitution of one or more nucleotides. 

10 Isolated or purified as it refers to preparations made from biological cells 

or hosts means any cell extract containing the indicated DNA or protein including 
a crude extract of the DNA or protein of interest. For example, in the case of a 
protein, a purified preparation can be obtained following an individual technique 
or a series of preparative or biochemical techniques and the DNA or protein of 
15 interest can be present at various degrees of purity in these preparations. The 
procedures may include for example, but are not limited to, ammonium sulfate 
fractionation, gel filtration, ion exchange chromatography, affinity 
chromatography, density gradient centrifugation and electrophoresis. 

A preparation of DNA or protein that is "substantially pure" or "isolated" 
20 should be understood to mean a preparation free from naturally occurring 
materials with which such DNA or protein is normally associated in nature. 
"Essentially pure" should be understood to mean a "highly" purified preparation 
that contains at least 95% of the DNA or protein of interest. 

A cell extract that contains the DNA or protein of interest should be 
25 understood to mean a homogenate preparation or cell-free preparation obtained 
from cells that express the protein or contain the DNA of interest. The term 
"cell extract" is intended to include culture media, especially spent culture media 

from which the cells have been removed. 

As used herein, a packaging cell line is a cell line that provides a missing 

30 gene product or its equivalent. 

As used herein, an adenovirus viral particle is the minimal structural or 

functional unit of a virus. A virus can refer to a single particle, a stock of 
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particles or a viral genome. The adenovirus (Adi particle is relatively complex 

and may be resolved into various substructures. 

As used herein, "penton" or "penton complex" are preferentially used 
herein to designate a complex of penton base and fiber. The term "penton" may 
5 also be used to indicate penton base, as well as penton complex. The meaning 
of the term "penton" alone should be clear from the context within which it is 

used. 

As used herein, a plasmid refers to an autonomous self-replicating 
extrachromosomal circular nucleic acid molecule, typically DNA. 

10 As used herein, a post-transcription regulatory element (PRE) is a 

regulatory element found in viral or cellular messenger RNA that is not spliced, 
i e intronless messages. Examples include, but are not limited to, human 
hepatitis virus, woodchuck hepatitis virus, the TK gene and mouse histone gene. 
The PRE may be placed before a polyA sequence and after a heterologous DNA 

15 sequence. 

As used herein, pseudotyping describes the production of adenoviral 

vectors having modified capsid protein or capsid proteins from a different 
serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 fiber protein. This may be 
20 accomplished by producing the adenoviral vector in packaging cell lines 

expressing different fiber proteins. 

As used herein, promoters of interest herein may be inducible or 
constitutive. Inducible promoters will initiate transcription only in the presence 
of an additional molecule; constitutive promoters do not require the presence of 
25 any additional molecule to regulate gene expression, a regulable or induc.ble 
promoter may also be described as a promoter where the rate or extent of RNA 
polymerase binding and initiation is modulated by external stimuli. Such stimul. 
include, but are not limited to various compounds or compositions, light, heat, 
stress and chemical energy sources. Inducible, suppressible and repressive 
30 promoters are considered regulable promoters. Preferred promoters here.n, are 
promoters that are selectively expressed in ocular cells, particularly 
photoreceptor cells. 
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As used herein, receptor refers to a biologically active molecule that 
specifically or selectively binds to (or with) other molecules. The term "receptor 
protein" may be used to more specifically indicate the proteinaceous nature of a 

specific receptor. 

5 As used herein, recombinant refers to any progeny formed as the result of 

genetic engineering. This may also be used to describe a virus formed by 

recombination of plasmids in a packaging cell. 

As used herein, a transgene or therapeutic nucleic acid molecule includes 
DNA and RNA molecules encoding an RNA or polypeptide. Such molecules may 
10 be "native" or naturally-derived sequences; they may also be "non-native" or 
"foreign" that are naturally- or recombinantly-derived. The term "transgene," 
which may be used interchangeably herein with the term therapeutic nucleic 
acid molecule," is often used to describe a heterologous or foreign (exogenous) 
gene that is carried by a viral vector and transduced into a host cell. 

T5 Therefore, therapeutic nucleotide nucleic acid molecules include antisense 

sequences or nucleotide sequences which may be transcribed into antisense 
sequences. Therapeutic nucleotide sequences (or transgenes) all include nucleic 
acid molecules that function to produce a desired effect in the cell or cell nucleus 
into which said therapeutic sequences are delivered. For example, a therapeutic 
20 nucleic acid molecule can include a sequence of nucleotides that encodes a 
functional protein intended for delivery into a cell which is unable to produce 
that functional protein. 

As used herein, the vitreous of the eye refers to material that fills the 
chamber behind the lens of the eye (i.e., vitreous humor or vitreous body). 

25 As used herein, a promoter region refers to the portion of DNA of a gene 

that controls transcription of the DNA to which it is operatively linked. The 
promoter region includes specific sequences of DNA that are sufficient for RNA 
polymerase recognition, binding and transcription initiation. This portion of the 
promoter region is referred to as the promoter. In addition, the promoter region 
30 includes sequences that modulate this recognition, binding and transcription 

initiation activity of the RNA polymerase. These sequences may be c/s acting or 




PCT/EP01/04863 




WO 01/83729 



- 18 - 



may be responsive to trans acting factors. Promoters, depending upon the 
nature of the regulation, may be constitutive or regulated. 

Thus, promoters are nucleic acid fragments that contain a DNA sequence 
that controls the expression of a gene located 3' or downstream of the 
5 promoter. The promoter is the DNA sequence to which RNA polymerase 
specifically binds and initiates RNA synthesis (transcription) of that gene, 
typically located 3' of the promoter. A promoter also includes DNA sequences 
that direct the initiation of transcription, including those to which RNA 
polymerase specifically binds. If more than one nucleic acid sequence encoding a 
10 particular polypeptide or protein is included in a therapeutic viral vector or 
nucleotide sequence, more than one promoter or enhancer element may be 
included, particularly if that would enhance efficiency of expression. 

A regulable or inducible promoter may be described as a promoter 
wherein the rate of RNA polymerase binding and initiation is modulated by 
15 external stimuli, (see, e.g., U.S. Patent Nos. 5,750,396 and 5,998,205). Such 
stimuli include various compounds or compositions, light, heat, stress, chemical 
energy sources, and the like. Inducible, suppressible and repressive promoters 

are considered regulatable promoters. 

Regulatable promoters may also include tissue-specific promoters. 

20 Tissue-specific promoters direct the expression of the gene to which they are 

operably linked to a specific cell type. Tissue-specific promoters cause the gene 
located 3' of it to be expressed predominantly, if not exclusively, in the specific 
cells where the promoter expressed its endogenous gene. Typically, it appears 
that if a tissue-specific promoter expresses the gene located 3' of it at all, then 
25 it is expressed appropriately in the correct cell types (see, e.g.. Palmiter et al. 

(1986) Ann. Rev. Genet. 20: 465-499). 

As used herein, the phrase "operatively linked" generally means the 
sequences or segments have been covalently joined into one piece of DNA, 
whether in single or double stranded form, whereby control sequences on one 
30 segment control expression or replication or other such control of other 
segments. The two segments are not necessarily contiguous. 
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As used herein, exogenous encompasses any therapeutic composition 
that is administered by the therapeutic methods provided herein. Thus, 
exogenous may also be referred to herein as foreign, or non-native or other 

equivalent expression. 

5 B. Ad37 fiber tropism 

The adenovirus fiber protein is a major determinant of adenovirus tropism 
(Gall era/. (1996) J. Virol. 70:211 6-21 23; Stevenson era/. I1995IJ. Vital. 
69-2850-2857). The fiber protein extends from the capsid and mediates viral 
binding to the cel, surface by binding to specific ceil receptors iPhiiipson era/ 

10 (19681 J. Virol. 2:1064-1075). The fiber is a dimeric protein that includes an N- 

terminal tail domain that interacts with the adenovirus penton bass, a centra 
shaft domain of varying length, and a C-termina, knob domain the, contains the 
cell receptor binding site (Chrobocrek era/. (1995) Cun.Top.MicrobioUmmunol. 

/ 99:1 63-200: Riurok a, a/. ,1990) J.Moi.Bioi. 2/5:589-596: Stevenson era/. 

15 (1 9951 J. Virol. 69:2850-2857). Fiber proteins of most adenovirus subgroups 

have been shown to bind specif, caily or seiectiveiy to die 46 »* 
adenovirus receptor (CAR), IBergelson eta/. (1997) Science 275:1320-1323 
Roelvink era/. (1998) J. Virol. 72:7909-7915). CAR appears to be expressed in 
a variety of human tissues, including the lung, a, various levels (Bergelson er a/. 
20 (1 997) Science 275,1 320-1 323), but Ad37 binds poorly to lung epithelial cells 

(Huang et el. (1999) J. Virol. 73:2798-2802). This suggests that the tropism o 
this serotype may be influenced by factors independent of CAR expression 

Structural and biochemical data also suggest that distinct receptor binding 
sites are located on different regions of the Ad5 and Ad37 fiber knobs 
25 Adopting die nomenclature of Xia et al. (Xia e, a,. (1 994, Structure 2:1259- 
1270), the receptor binding site for Ad5 is located at the AB-loop on the side 
the fiber knob (Bewley era/. (1999, Science 236:1579-1 583: Roelvink era/. 

„ 999) Science 286:1 568-1 571 ). It is known the, a lysine residue a, position 
240 of the Ad37 fiber, located in the CD-loop, is important fo, receptor binding 
30 (Huang era/. (1999, 3. Virol. 73:2798-2802). The co-crysta, structure of the 
Adi 2 knob and the N-terminal domain of CAR (Bewley eta/. (19991 Sc, ence 
266:1579-1583) show that the CD-loop does not contact CAR. It thus appears 
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that different regions of the Ad5 and Ad37 fiber knobs recognize distinct cell 
receptors. 

A 46 kDa receptor for coxsackieviruses and adenoviruses (CAR) mediates 
attachment for many adenovirus serotypes. The wide distribution of CAR fails 
5 to explain why certain adenovirus serotypes (i.e. Ad37) are highly associated 
with severe ocular infections such as epidemic keratoconjunctivitis (EKC). Ad37 
does not use CAR, but instead uses a glycoprotein that contains sialic acid as its 
primary receptor (Arnberg et at. ((2000) J. Virol. 74:42-48). The modest 
number of Ad37 binding sites per cell (Huang ef at. (1 999) J. Virol. 73: 2798- 
10 2802) also suggests that Ad37 recognizes a specific glycoprotein as its primary 

receptor for binding to conjunctival cells. 

Adenovirus type 37 (subgroup D) has been associated with infections of 

the eye and genital tract. The tropism of Ad37 derives from the binding 
preference of its fiber protein, which binds to a receptor located on the surface 
15 of cells including Chang C, conjunctival epithelial cell line (Huang et a!. (1999) J. 
Virology 73:2798-2802). 

A protein receptor that is preferentially expressed on conjunctival ceils to 
which Ad37 fiber. binds is shown herein. The preferential expression of the 
Ad37 receptor protein on conjunctival cells suggests that this receptor likely 
20 influences Ad37 tropism end should play a key role in ocular pathogenesis. It is 
shown herein that Ad37 uses a distinct protein receptor that is selectively 
expressed on conjunctival cells. It is shown that Ad37 binds well to conjunctival 
cells (Chang C), but poorly to lung carcinoma cells (A549). To determine if 
infection correlated with cell binding, an Ad5 vector containing the Ad37 fiber 
25 protein was constructed. The •pseudotyped' vector delivered transgenes to 
Chang C cells better than to A549 cells. Ad37 binding was abolished by 
protease treatment of Chang C cells, indicating the receptor is a membrane 
protein. Ad37 binding to conjunctival cells is shown herein to be calcium- 
dependent. It is also shown that Ad37 infection was not inhibited by a function 
30 blocking anti-CAR monoclonal antibody, which is a feature distinct from Ad6 
fiber interaction with CAR. Using a virus overlay protein blot assay (VOPBA), 
calcium-dependent Ad37 binding to a 50 KDa membrane protein on Chang C 
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cells, but not A549 cells was detected. Ad19p a closely related serotype that 
fails to bind to conjunctival cells, does not recognize the 50 kDa protein. 

Together, these data indicate that the 50 kDa protein is a candidate receptor for 

Ad37 on conjunctival cells. 

5 Significantly, it is also shown herein that, upon administration of the 

vector to the vitreous humor, the recombinant adenovirus with the Ad37 fiber 
preferentially and selectively binds to photoreceptor cells. Hence, a recombinant 
adenoviral delivery vehicle that has an Ad37 fiber protein can serve as a vector 
for delivery of therapeutic agents to the eye for treatment of ocular disorders, 

10 including genetic and acquired disorders. The identification of the receptor for 
A d37 and the resulting recognition of Ad37 tropism allows targeting of 

adenovirus vectors to specific human ocular cells. 

As noted, fiber plays a crucial role in adenovirus infection by attaching 
the virus to a specific receptor on a cell surface. Hexon, penton and fiber 
1 5 capsomeres are the major components on the surface of the virion. The fiber is 
an elongated protein which exists as a trimer of three identical polypeptides 
(polypeptide IV) of 582 amino acids in length. An adenovirus fiber includes 
three domains: an N-terminal tail domain that interacts with penton base; a 
shaft composed of variable numbers of repeats of a 15-amino-acid segment that 
20 forms beta-sheet and beta-bends; and a knob at the C-terminus (“head domain") 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. The gene encoding the fiber protein from Ad2 has been 
expressed in human cells and has been shown to be correctly assembled into 
trimers, glycosylated and transported to the nucleus (see, e.g., Hong and Engler, 
25 Virology 185: 758-761, 1991). Thus, alteration of the fiber in recombinant Ad 

vectors can lead to alteration in gene delivery. 

As shown herein, alteration of fiber in recombinant Ad vectors such that 

the fiber is derived from Ad37 or another adenovirus serotype D, provides a 
means for selective delivery of a recombinant virus to particular cells in the eye, 
30 including conjunctival cells, and most significantly photoreceptors, thereby 
providing a means for targeted delivery to photoreceptor cells. 
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Photoreceptor cells are implicated in a number of hereditary and acquired 
retinal degenerative disorders. In addition, photoreceptor cells are located such 
that products produced therein can be delivered to other areas of the eye by 
virtue of the blood flow in the vicinity of the photoreceptor cells and also by 
5 virtue of the proximity of the photoreceptors to the retinal pigmented epithelium 
(RPE) and other retinal cells. 

Hence it is contemplated herein that the recombinant viral vector will 
include a packaged recombinant adenovirus genome containing at least the 
minimal elements for replication and packaging; heterologous DNA encoding a 
10 desired gene product, typically a therapeutic product or plurality of products, 

such as several trophic factors, whose combined activity is effective for treating 
a disorder, such as a retinal degenerative disorder; and the resulting virion 
particles will include a fiber that has a sufficient portion to confer specific 
targeting to photoreceptor cells when the recombinant viral particles are 
15 introduced into the aqueous humor of a mammalian, preferably a human, eye, or 
otherwise contacted with the photoreceptor cells. The fiber may be a chimeric 
protein that has been modified for effective interaction with other coat structural 
proteins, such as penton. In addition, the fiber may be modified to.include other 
elements that alter its tropism to permit binding to other cells as well (see, e.g.. 
20 U.S. Patent Nos. 5,756,086 and 5,543,328, International PCT application No. 
WO 95/26412 and WO 98/44121 and Krasnykh, eta/. [J. Virol. 70: 6839-46, 

1996). 

C. Construction of the viral particles 

1 . Selection of viral genome and fiber protein 
25 Methods for preparing recombinant adenoviral vectors for gene 

product delivery are well known. Preferred among those are the methods 
exemplified herein (see EXAMPLES) and also described in copending U.S. 
application Serial No. 09/482,682 (also filed as International PCT application 
No. PCT/US00/00265, filed January 14, 2000, which claims pnority to U.S. 

30 provisional application Serial No. 60/1 1 5,920, as does U.S. application Serial 
No. 09/482,682)). 
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As noted, any desired recombinant adenovirus is contemplated for use in 
the methods herein as long as the viral genome is packaged in a capsid that 
includes at least the portion of a fiber protein that provides selectrve binding to 
photoreceptor cells. This fiber protein is preferably from an adenovirus type D 
6 serotype and is preferably an Ad37 fiber. The fiber protein should retain the 
knob region at the C-terminus I’head domain") from the Ad virus of subgroup D 
, ha , contains the type-specific antigen and Is responsible for binding to the cell 
surface receptor. Hence the fiber protein can be a chimeric fiber protein as long 
as i, retains a sufficient portion of the type 9 serotype to specifically o, 
to selectively bind to photoreceptor cells. Generally the portion returned will be all 
a portion of the knob region. The precise amount of knob region required can 
be determined empirically by including portions thereof and identifying the 
minimum residues from and Ad type D serotype, preferably Ad37, to effect 
selective targeting of a virion packaged with such fiber to photoreceptors ,n the 
16 eye upon introduction of the packaged virion into the aqueous humor. 

Recombinant adenovirus containing heterologous nucleic acids that 
encode a desired product, such a gene to correct a genetic defect, may be made 
by any methods known to those of skill in the art. The viruses must be 
packaged in a cell line that results in expression of fiber on the particles t at 
20 specifically, electively or preferentially targets (binds and results in 

internalization) the viral particle to cells in the eye. The fiber protein from Ad37 
and other Adenoviruses of serotype D that Infect the eye effects such targeting. 
The resulting adenovirus particles that express such fiber Is administered by 
intraocular injection, subretinal injection, particularly intravitreal injection, or any 
25 means that results in preferential accumulation in photoreceptor cells. 

The family of Adenoviridae includes many members with at least 47 
known serotypes of human adenovirus (Ad1-Ad47) IShenk. Wogy Chapter 
67 in Reids e, a/., eds. Lippincott-Raven. Philadelphia. 1 996.) as wel, as 
members of the genus Mastadenovlrus including human, simian, bovine, equine 
30 porcine, ovine, canine and opossum viruses and members of the Av, adenovirus 
genus, including bird viruses, such as CELO. 




PCT/EP01/04863 



WO 01/83729 





-24- 



Thus It is contemplated that the methods herein can be applied to any 
recombinant viral vectors derived from any adenovirus species. One of skill in 
the art would have knowledge of the different adenoviruses Isee, e.p..Shenk. 
Virology, Chapter 67. in Fields era/., eds. Lippincott-Raven, Philadelphia, 1996,1 
5 and can construct recombinant viruses containing portions of the genome of any 

such virus. 

In the exemplified embodiment, viral particles with Ad37 fiber were 
prepared. Site-directed mutations were made to the Ad37 fiber gene to make 
the tail sequence more closely match that of Ad5 to facilitate Ad37 fiber binding 
10 to the AdS penton base. The plasmid for the expression of the Ad37 fiber 

protein pDVSO, contains the CMV promoter, the adenovirus type 6 tnparbte 
leader (TPLI, and the modified Ad37 fiber gene sequence. Genes of »«•"**. 
such as nucleic acid encoding the ft subunit of cGMP phosphodiesterase tfPDE), 
8-glucuronidase, rhodopsin, growth factors, anti-cancer agents, growth factor 
15 receptors and other anti-angiogenic agents, and anti-apoptotie agents, can be 

incorporated into these vectors using the methods known to those of skill ,n the 

art and exemplified herein. , . 

Known adenovirus vectors, previously constructed for intraocular therapy 

isee, e.g., Bennett et a/. (1996) Warcre Medicine 2:649-654, which provides an 
20 Ad virus encoding /TOE for treatment of retinitis pigmentosa: Cayouette er a/. 
(1998) Human Gene Therapy 8:423-430, which provides an Ad vector that 
expresses CNTF for treatment of retinitis pigmentosa and other retinal 
degenerative diseases; and Li et a/. (1 995) Prop. Natl. Acad. Sc/. U.S.A. 

92: 7700-7704, which provides an Ad virus vector that encodes a human f- 
23 glucuronidase for treatment of lysosomal storage disease caused by t- 

glucuronidase deficiency) can be modified by repackaging the recombinant 
genome using a packaging line that expresses an Ad37 fiber or other D serotype 

fiber. 

For exemplification, nucleic acid encoding GFP was incorporated into 
30 these vectors as a means to visualize their localization. Other genes, such as 

genes that encode therapeutic products, my be included in place of or in addition 

to GFP. 
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Plasmid pDV80 was electroporated into El -2a S8 cells and stable lines 
were selected. The fiber-deleted vectors Ad5./?gal.AF and Ad5.GFP.AF were 
grown in cells in a resulting cell line, designated 705, to produce virions, which 
express the Ad37 fiber (Ad5.yffgal.AF/37F and Ad5.GFP.AF/37F) and CsCI- 
5 purified. These virions selectively transduce photoreceptor cells when injected 
intraocularly into the vitreous humor. 

2. Packaging 

Recombinant adenoviral vectors generally have at least a deletion in the 
first viral early gene region, referred to as El, which includes the Ela and Elb 
10 regions. Deletion of the viral El region renders the recombinant adenovirus 

defective for replication and incapable of producing infectious viral particles in 
subsequently-infected target cells. Thus, to generate El -deleted adenovirus 
genome replication and to produce virus particles requires a system of 
complementation which provides the missing El gene product. El 
15 complementation is typically provided by a cell line expressing El, such as the 
human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 
293. Cell line 293 contains the El region of adenovirus, which provides El 
gene region products to "support" the growth of El -deleted virus in the cell line 
(see, e.g., Graham ef a/., J. Gen. Virol. 36: 59-71 , 1 977). Additionally, cell lines 
20 that may be usable for production of defective adenovirus having a portion of 
the adenovirus E4 region have been reported (WO 96/22378). 

Multiply deficient adenoviral vectors and complementing cell lines have 
also been described (WO 95/34671 , U.S. Patent No. 5,994,106). 

Copending U.S. application Serial No. 09/482,682 (also filed as 
25 International PCT application No. PCT/US00/OO265, filed January 14, 2000)) 
provides packaging cell lines that support viral vectors with deletions of major 
portions of the viral genome, without the need for helper viruses and also 
provides cell lines and helper viruses for use with helper-dependent vectors. 

The packaging cell line has heterologous DNA stably integrated into the 
30 chromosomes of the cellular genome. The heterologous DNA sequence encodes 
one or more adenovirus regulatory and/or structural polypeptides that 
complement the genes deleted or mutated in the adenovirus vector genome to 
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be replicated and packaged. The packaging cell line express, for example, one 
or more adenovirus structural proteins, polypeptides, or fragments thereof, such 
as penton base, hexon, fiber, polypeptide Ilia, polypeptide V, polypeptide VI, 
polypeptide VII, polypeptide VIII. and biologically active fragments thereof. The 
5 expression can be constitutive or under the control of a regulable promoter. 

These cell lines are designed for expression of recombinant adenoviruses 

intended for delivery of therapeutic products. 

Particular packaging cell lines complement viral vectors having a deletion 
or mutation of a DNA sequence encoding an adenovirus structural protein, 

10 regulatory polypeptides El A and El B, and/or one or more of the following 

regulatory proteins or polypeptides: E2A, E2B. E3. E4, L4, or. fragments thereof. 

The packaging cell lines are produced by introducing each DNA molecule 
into the cells and then into the genome via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
15 introduced via a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 

such as Ad 37, polypeptide or fragment thereof. 

For therapeutic applications, the delivery plasmid further includes a 
20 nucleotide sequence encoding a foreign polypeptide. Exemplary delivery 

plasmids include, but are not limited to, P DV44, pAE1B/?-gal and pAEIspIB. In 
a similar or analogous manner, therapeutic genes may be introduced. 

The cell further includes a complementing plasmid encoding a fiber as 
contemplated herein; the plasmid or portion thereof is integrated into a 

25 chromosome (s) of the cellular genome of the cell. 

In one embodiment, a composition comprises a cell containing first and 

second delivery plasmids wherein a first delivery plasmid comprises an 
adenovirus genome lacking a nucleotide sequence encoding fiber and incapable 
of directing the packaging of new viral particles in the absence of a second 
30 delivery plasmid, and a second delivery plasmid comprises an adenoviral genome 
capable of directing the packaging of new viral particles in the presence of the 

first delivery plasmid. 
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a variation, the packaging call lino expresses fiber pro, am or chrmenc 
variant thereof from an Ad virus o, subgroup O. preferably Ad37. serotype or i, 
can be any fiber protein but one the, has been modified to inoiude the portion 0, 
the Ad virus of subgroup D. such as Ad37. responsible for selective targeting ,0 
photoreceptors upon introduction into the vitreous humor of the eye ofa 
mammal, preferably a human. The fiber protein can be further modified o 
include a non-native amino acid residue sequence that targets additional specific 
receptors. In all instances, the modification shouid no, disrupt trime, formation 
or transport of fiber into the nucleus. In another variation, the non-natnre ammo 
acid residue sequence alters the binding specificity of the fiber for a targeted cel, 
type. The structural protein is fiber can inoiude amino acid residue sequences 
from more than one adenovirus serotype. The nucleotide sequences encoding 
fiber protein or polypeptide need no, be modified solely a, one or both termm,: 
fiber protein, may be modified -internally" as well as a, the t.rm,n, 

Additional nucleic acid fragments can encode polypeptides that are 
the fiber protein. In one variation, the non-netiv, amino acid residue 
sequence is coupled to the carboxyl terminus of the fiber. In another, t . non 
native amino acid residue sequence further includes a linker s«du.ncm 
Alternatively, the fiber protein further comprises a ligand coupled ,0 he linker. 
Suitable ligands include, but ere no, limited to. ligands that specifically or 
selectively bind to a cel, surface receptor and ligands that can be used 0 couphe 
other proteins or nucleic acid molecules. Typically, the packaging cell Mines w„, 
contain nucleic acid encoding the fiber protein or modified protein stably 
integrated into a chromosome or chromosomes in the cellular genome 

The packaging cell line can be derived from e procaryotic cell „ne or from 
a eukaryotic cell line. While various embodiments suggest the use of mammalian 
cells, and more particularly, epithelial cell lines, a variety of other, non-.p«hel,a 
cell lines are used in various embodiments. Thus, while various embed, nfents 
Lose the use of a cel, line selected from among the 293. A549, W162. HeLa 
Veto. 2„. and 21 1A oel, lines, and any other cel, lines suitable for such use are 

likewise contemplated herein. . . . 

, romnonents of the nucleic acid molecule included in the part,cle 
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A recombinant viral vector or therapeutic viral vector for use in the 
methods herein, typically includes a nucleic acid fragment that encodes a 
protein or polypeptide molecule, or a biologically active fragment thereof, or 
other regulatory sequence, that is intended for use in therapeutic applications. 

5 The nucleic acid molecule to be packaged in the viral particle also may 

include an enhancer element and/or a promoter located 3' or 5' to and 
controlling the expression of the therapeutic product-encoding nucleic acid 
molecule if the product is a protein. Further, for purposes herein, the promoter 
and/or other transcriptional and translational regulatory sequences controlling 
10 expression of the product is preferably one that is expressed specifically in the 
targeted cells, such as the a photoreceptor-specific promoter, such as a 
rhodopsin gene promoter. 

The nucleic acid molecule to be packaged in viral capsid includes at least 
2 different operatively linked DNA segments. The DNA can be manipulated and 
15 amplified by PCR as described herein and by using standard techniques, such as 
those described in Molecular Cloning: A Laboratory Manual. 2nd Ed., Sambrook 
ef a/., eds.. Cold Spring Harbor, New York (1989). Typically, to produce such 
molecule, the sequence encoding the selected polypeptide and the promoter or 
enhancer are operatively linked to a DNA molecule capable of autonomous 
20 replication in a cell either in vivo or in vitro. By operatively linking the enhancer 
element or promoter and nucleic acid molecule to the vector, the attached 
segments are replicated along with the vector sequences. 

Thus, the recombinant DNA molecule (rDNA) is a hybrid DNA molecule 
comprising at least 2 nucleotide sequences not normally found together in 
25 nature. In various preferred embodiments, one of the sequences is a sequence 
encoding an Ad-derived polypeptide, protein, or fragment thereof. The nucleic 
acid molecule intended to be packaged is from about 20 base pairs to about 
40,000 base pairs in length, preferably about 50 bp to about 38,000 bp in 
length. In various embodiments, the nucleic acid molecule is of sufficient length 
30 to encode one or more adenovirus proteins or functional polypeptide portions 
thereof. Since individual Ad polypeptides vary in length from about 19 amino 
acid residues to about 967 amino acid residues, encoding nucleic acid molecules 
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from about 50 bp up to about 3000 bp, depending on the number and size of 
individual polypeptide-encoding sequences that are "replaced" in the viral 
vectors by therapeutic product-encoding nucleic acid molecules. 

Preferably the molecule includes an adenovirus tripartite leader (TPL) 

5 nucleic acid sequence operatively linked to an intron containing RNA processing 
signals (such as for example, splice donor or splice acceptor sites) suitable for 
expression in the packaging cell line. Most preferably the intron contains a splice 
donor site and a splice acceptor site. Alternatively, the TPL nucleotide sequence 
may not comprise an intron. The intron includes any sequence of 

10 nucleotides that function in the packaging cell line to provide RNA processing 
signals, including splicing signals. Introns have been well characterized from a 
large number of structural genes, and include but are not limited to a native 
intron 1 from adenovirus, such as Ad5's TPL intron 1; others include the SV40 
VP intron; the rabbit beta-globin intron, and synthetic intron constructs (see, 

15 e.g ., Petitclerc et al. (1995 )J. BiothechnoL, 40: 169; and Choi eta/. <19910 Mol. 
Cell. Biol. r 11- 3070). 

The nucleic acid molecule encoding the TPL includes either (a) first and 
second TPL exons or (b) first, second and third TPL exons, where each TPL exon 
in the sequence is selected from among the complete TPL exon 1, partial TPL 
20 exon 1 , complete TPL exon 2 and complete TPL exon 3. A complete exon is 

one which contains the complete nucleic acid sequence based on the sequence 
found in the wild type viral genome. Preferably the TPL exons are from Ad2, 
Ad3, Ad5, Ad7 and the like, however, they may come from any Ad serotype, as 
described herein. A preferred partial TPL exon 1 is described in the Examples. 
25 The use of a TPL with a partial exon 1 has been reported (International PCT 
application No. WO 98/13499). 

The intron and the TPL exons can be operatively linked in a variety of 
configurations to provide a functional TPL nucleotide sequence. An intron may 
not be a part of the construct. For example, the intron can be positioned 
30 between any of TPL exons 1 , 2 or 3, and the exons can be in any order of first 
and second, or first/second/third. The intron can also be placed preceding the 
first TPL exon or following the last TPL exon. In a preferred embodiment. 
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complete TPL exon 1 is operatively linked to complete TPL exon 2 operatively 
linked to complete TPL exon 3. In a preferred variation, adenovirus TPL intron 1 
is positioned between complete TPL exon 1 and complete TPL exon 2. It may 
also be possible to use analogous translational regulators from other viral 
5 systems such as rabiesvirus. 

A preferred "complete" TPL nucleic acid molecule containing complete 
TPL exons 1, 2 and 3 with adenovirus intron 1 inserted between exons 1 and 2 
has a nucleotide sequence shown in SEQ ID NO: 32. A preferred "partial" TPL 
nucleic acid molecule containing partial TPL exon 1 and complete TPL exons 2 
10 and 3 in that order has a nucleotide sequence shown in SEQ ID NO: 26. The 
construction of these preferred TPL nucleotide sequences is described in the 

Examples. 

Thus, preferred expression cassettes and complementing plasmids for 
expressing adenovirus structural genes, particularly fiber protein, contain an 
15 adenovirus TPL nucleotide sequence as described herein. 

4. Complementing Plasmids 

Also contemplated are the use of nucleic acid molecules, typically in the 
form of DNA plasmid vectors, which are capable of expression of an adenovirus 
structural protein or regulatory protein. Because these expression plasmids are 
20 used to complement the defective genes of a recombinant adenovirus vector 
genome, the plasmids are referred to as complementing or complementation 

plasmids. 

The complementing plasmid contains an expression cassette, a nucleotide 
sequence capable of expressing a protein product encoded by the nucleic acid 
25 molecule. Expression cassettes typically contain a promoter and a structural 

gene operatively linked to the promoter. The complementing plasmid can further 
include a sequence of nucleotides encoding TPL nucleotide to enhance 
expression of the structural gene product when used in the context of 
adenovirus genome replication and packaging. 

30 A complementing plasmid can include a promoter operatively linked to a 

sequence of nucleotides encoding an adenovirus structural polypeptide, such as, 
but are not limited to, penton base; hexon; fiber; polypeptide Ilia; polypeptide V; 
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polypeptide VI; polypeptide VII; polypeptide VIII; and biologically active 
fragments thereof. In another variation, a complementing plasmid may also 
include a sequence of nucleotides encoding a first adenovirus regulatory 
polypeptide, a second regulatory polypeptide, and/or a third regulatory 
5 polypeptide, and any combination of the foregoing. 

Plasmid pDV80 is a preferred plasmid herein. Other plasmids constructed 
in an analogous manner to encode modified fiber proteins and chimeric fiber 
proteins are also contemplated herein. 

5. Nucleic Acid Molecule Synthesis 

10 A nucleic acid molecule comprising synthetic oligonucleotides can be 

prepared using any suitable method, such as the phosphotriester or 
phosphodiester methods (see, e.g., Narang (1979) eta!., Meth. Enzyme!., 

68:90; U.S. Patent No. 4.356,270; and Brown eta!., (1979) Meth. Enzymol., 
68:109). For oligonucleotides, the synthesis of the family members can be 
15 conducted simultaneously in a single reaction vessel, or can be synthesized 

independently and later admixed in preselected molar ratios. For simultaneous 
synthesis, the nucleotide residues that are conserved at preselected positions of 
the sequence of the family member can be introduced in a chemical synthesis 
protocol simultaneously to the variants by the addition of a single preselected 
20 nucleotide precursor to the solid phase oligonucleotide reaction admixture when 
that position number of the oligonucleotide is being chemically added to the 
growing oligonucleotide polymer. The addition of nucleotide residues to those 
positions in the sequence that vary can be introduced simultaneously by the 
addition of amounts, preferably equimolar amounts, of multiple preselected 
25 nucleotide precursors to the solid phase oligonucleotide reaction admixture 
during chemical synthesis. For example, where all four possible natural 
nucleotides (A,T,G and C) are to be added at a preselected position, their 
precursors are added to the oligonucleotide synthesis reaction at that step to 
simultaneously form four variants (see, e.g., Ausubel eta!. (Current Protocols in 
30 Molecular Biology, Suppl. 8. p.2.1 1 .7, John Wiley & Sons, Inc., New York 
,1991). 
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Nucleotide bases other than the common four nucleotides (A,T,G or C), or 
the RNA equivalent nucleotide uracil (U>, can also be used. For example, it is 
well known that inosine (I) is capable of hybridizing with A, T and G, but not C. 
Examples of other useful nucleotide analogs are known in the art and may be 

5 found referred to in 37 C.F.R. §1.822. 

Thus, where all four common nucleotides are to occupy a single position 

of a family of oligonucleotides, that is, where the preselected nucleotide 
sequence is designed to contain oligonucleotides that can hybridize to four 
sequences that vary at one position, several different oligonucleotide structures 
10 are contemplated. The composition can contain four members, where a 

preselected position contains A,T,G or C. Alternatively, a composition can 
contain two nucleotide sequence members, where a preselected position 
contains I or C, and has the capacity to hybridize, at that position to all four 
possible common nucleotides. Finally, other nucleotides may be included at the 
1 5 preselected position that have the capacity to hybridize in a non-destabilizing 
manner with more than one of the common nucleotides in a manner similar to 

inosine. 

Similarly, larger nucleic acid molecules can be constructed in synthetic 
oligonucleotide pieces, and assembled by complementary hybridization and 

20 ligation, as is well known. 

D. Adenovirus Expression Vector Systems 

The adenovirus vector genome that is encapsulated in the virus particle 

and that expresses exogenous genes in a gene therapy setting is a key 
component of the system. Thus, the components of a recombinant adenovirus 
25 vector genome include the ability to express selected adenovirus structural 
genes, to express a desired exogenous protein, and to contain 

sufficient replication and packaging signals that the genome is 
packaged into a gene delivery vector particle. The preferred 
replication signal is an adenovirus inverted terminal repeat 
30 containing an adenovirus origin of replication, as is well known 

and described herein. 
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Although adenovirus include many proteins, not all adenovirus proteins 
are required for assembly of a recombinant adenovirus particle (vector). Thus, 
deletion of the appropriate genes from a recombinant Ad vector permits 
accommodation of even larger "foreign" DNA segments. 

A preferred recombinant adenovirus vector genome is "helper 
independent" so that genome can replicate and be packaged without the help of 
a second, complementing helper virus. Complementation is provided by a 



10 



15 



packaging cell. 

In a preferred embodiment, the adenovirus vector genome does not 
encode a functional adenovirus fiber protein. A non-functional fiber gene refers 
to a deletion, mutation or other modification to the adenovirus fiber gene such 
that the gene does not express any or insufficient adenovirus fiber protem to 
package a fiber-containing adenovirus particle without complementation of the 
fiber gene by a complementing plasmid or packaging cell line. Such a genome is 
referred to as a "fiberless" genome, not to be confused with a fiberless particle. 
Alternatively, a fiber protein may be encoded but is insufficiently expressed to 



result in a fiber containing particle. 

Thus, contemplated for use are helper-independent fiberless recombinant 
adenovirus vector genomes that include genes that (a) express all adenovirus 

20 structural gene products but express insufficient adenovirus fiber protem to 

package a fiber-containing adenovirus particle without complementation, of said 
fiber gene, (b) express an exogenous protein, and (c) contain an adenovirus 
packaging signal and inverted terminal repeats containing adenovirus origin of 

replication. 

25 The adenovirus vector genome is propagated in the laboratory in the form 

of rDNA plasmids containing the genome, and upon introduction into an 
appropriate host, the viral genetic elements provide for viral genome replication 
and packaging rather than plasmid-based propagation. Exemplary methods for 
preparing an Ad-vector genome are described in the Examples. 

30 A vector herein includes a nucleic acid (preferably DNA) molecule capable 

of autonomous replication in a cell and to which a DNA segment, e.g.. a gene or 
polynucleotide, can be operatively linked to bring about replication of the 
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attached segment. For purposes herein, one of the nucleotide segments to be 
operatively linked to vector sequences encodes at least a portion of a therapeutic 
nucleic acid molecule. As noted above, therapeutic nucleic acid molecules 
include those encoding proteins and also those that encode regulatory factors 
5 that can lead to expression or inhibition or alteration of expression of a gene 

product in a targeted cell. 

1 . Nucleic Acid Gene Expression Cassettes 
In various embodiments, a peptide-coding sequence of the therapeutic 
gene is inserted into an expression vector and expressed; however, it is also 
10 feasible to construct an expression vector which also includes some non-coding 
sequences as well. Preferably, however, non-coding sequences are excluded. 
Alternatively, a nucleotide sequence for a soluble form of a polypeptide may be 
utilized. Another preferred therapeutic viral vector includes a nucleotide 
sequence encoding at least a portion of a therapeutic nucleotide sequence 
15 operatively linked to the expression vector for expression of the coding sequence 

in the therapeutic nucleotide sequence. 

The choice of viral vector into which a therapeutic nucleic acid molecule 

is operatively linked depends directly, as is well known in the art, on the 
functional properties desired, e.g., vector replication and protein expression, and 
20 the host cell to be transformed - these being limitations inherent in the art of 
constructing recombinant DNA molecules. Although certain adenovirus 
serotypes are recited herein in the form of specific examples, it should be 
understood that the use of any adenovirus serotype, including hybrids and 
derivatives thereof are contemplated. 

25 A translatable nucleotide sequence is a linear series of nucleotides that 

provide an uninterrupted series of at least 8 codons that encode a polypeptide in 
one reading frame. Preferably, the nucleotide sequence is a DNA sequence. The 
vector itself may be of any suitable type, such as a viral vector (RNA or DNA), 
naked straight-chain or circular DNA, or a vesicle or envelope containing the 
30 nucleic acid material and any polypeptides that are to be inserted into the cell. 
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2. Promoters 

As noted elsewhere herein, an expression nucleic acid in an Ad-derived 
vector may also include a promoter, particularly a tissue or cell specific 
promoter, preferably one expressed in ocular cells, particularly photoreceptors. 

5 Promoters contemplaged for use herein include regulatable (inducible) as 

well as constitutive promoters, which may be used, either on separate vectors or 
on the same vector. Some useful regulatable promoters are those of the CREB- 
regulated gene family and include inhibin, gonadotropin, cytochrome c, 
glucagon, and the like. (See, e.g.. International PCT application No. WO 
10 96/14061). Preferably the promoter selected is from a photoreceptor-specific 

gene, such as a rhodopsin gene or gene that encodes a protein that regulates 

rhodopsin expression. 

E. Formulation and administration 

Compositions containing therapeutically effective concentrations of 
15 recombinant adenovirus delivery vectors are provided. These are for delivery of 
therapeutic gene products to cells, particularly cells express a particular 50 kDa 
receptor or other receptor with which the vectors interact. These cells include 
cells of the eye and genital tract. Of particular interest are photoreceptor cells of 
the eye. Administration is effected by any means through which contacting with 
20 the photoreceptors is effected. Preferable modes of administration include, but 
are not limited to, subretinal injection, particularly intravitreal injection, to provide 
access to photoreceptor cells. 

The recombinant viral compositions may also be formulated for 
implantation into the anterior or posterior chamber of the eye, preferably the 
25 vitreous cavity, in sustained released formulations, such as those adsorbed to 
biodegradable supports, including collagen sponges, or in liposomes. Sustained 
release formulations may be formulated for multiple dosage administration, so 
that during a selected period of time, such as a month or up to about a year, 
several dosages are administered. Thus, for example, liposomes may be 
30 prepared such that a total of about two to up to about five or more times the 
single dosage is administered in one injection. 
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The vectors are formulated in an ophthalmologically acceptable carrier for 
intraocular, preferably intravitreal, administration in a volume of between about 
0.05 ml and 0.150 ml, preferably about 0.05 and 0.100 ml. 

The composition can be provided in a sealed sterile vial containing an 
5 amount of a compound of formula I, that upon intraocular administration will 
deliver a sufficient amount of viral particles to the photoreceptors in a volume of 
about 50 to 1 50 fj\, containing at least about 10 7 , more preferably at least about 
10 8 plaque forming units in such volume. Typically, the vials will, thus, contain 
about 0.150 ml of the composition. 

10 To prepare compositions the viral particles are dialzyed into a suitable 

ophthalmologically acceptable carrier or viral particles, for example, may be 
concentrated and/or mixed therewith. The resulting mixture may be a solution, 
suspension or emulsion. In addition, the viral particles may be formulated as 
the sole pharmaceutically active ingredient in the composition or may be 
15 combined with other active agents for the particular disorder treated. 

For administration by intraocular injection or via eyedrops, suitable 
carriers include, but are not limited to, physiological saline, phosphate buffered 
saline (PBS), balanced salt solution (BSS),. lactate Ringers solution, and solutions 
containing thickening and solubilizing agents, such as glucose, polyethylene 
20 glycol, and polypropylene glycol and mixtures thereof. Liposomal suspensions 
may also be suitable as pharmaceutically acceptable carriers. These may be 
prepared according to methods known to those skilled in the art. Suitable 
ophthalmologically acceptable carriers are known. Solutions or mixtures 
intended for ophthalmic use may be formulated as 0.01% - 10% isotonic 
25 solutions, pH about 5-7, with appropriate salts [see, e^, U.S. Patent No. 

5,1 1 6,868, which describes typical compositions of ophthalmic irrigation 
solutions and solutions for local application]. Such solutions, which have a pH 
adjusted to about 7.4, contain, for example, 90-100 mM sodium chloride, 4-6 
mM dibasic potassium phosphate, 4-6 mM dibasic sodium phosphate, 8-12 mM 
30 sodium citrate, 0.5-1 .5 mM magnesium chloride, 1 .5-2.5 mM calcium chloride, 
15-25 mM sodium acetate, 10-20 mM D.L.-sodium 0-hydroxybutyrate and 5-5.5 

mM glucose. 
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The compositions may be prepared with carriers that protect them from 
rapid elimination from the body, such as time release formulations or coatings. 

Such carriers include controlled release formulations, such as, but not limited to, 
microencapsulated delivery systems, and biodegradable., biocompatible polymers, 

5 such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 

polyorthoesters, polylactic acid and other types of implants that may be placed 
directly into the anterior or posterior chamber or vitreous cavity of the eye. The 
compositions may also be administered in pellets, such as Elvax pellets 
(ethylene-vinyl acetate copolymer resin). 

,0 Liposomal suspensions, including tissue-targeted liposomes, may also be 

suitable as pharmaceutically acceptable carriers. For example, liposome 
formulations may be prepared by methods known to those of skill in the art [see, 
Kimm et aL d 983) Rinnh. Biooh. Acta 728:339-398; Assil et aL (1 987) 

Arch Dnhthalmol. 105:400; and U.S. Patent No. 4,522,81 1]. The viral particles 
15 may be encapsulated into the aqueous phase of liposome systems. 

The active materials can also be mixed with other active materials, that 
do not impair the desired action, or with materials that supplement the desired 
action or have other action, including viscoelastic materials, such as hyaluron.c 
acid, which is sold under the trademark HEALON, which is a solution of a high 
20 molecular weight (MW) of about 3 millions fraction of sodium hyaluronate 
[manufactured by Pharmacia, Inc; see, e^, U.S. Patent Nos. 5,292,362, 
5,282,851, 5,273,056, 5,229,127, 4,517,295 and 4,328,803], VISCOAT 
[fluorine-containing (meth)acrylates, such as, 1 H,1 H,2H,2H-hepta- 
decafluorodecylmethacrylate; see, e^ U.S. Patent Nos. 5.278,126, 5,273,751 
25 and 5,214,080; commercially available from Alcon Surgical, Inc.], ORCOLON 
[see, e^ U.S. Patent No. 5,273.056; commercially available from Optical 
Radiation Corporation], methylcellulose. methyl hyaluronate, polyacrylamide and 
polymethacrylamide [see, e^, U.S. Patent No. 5,273,751]. The viscoelastic 
materials are present generally in amounts ranging from about 0.5 to 5.0%, 

30 preferably 1 to 3% by weight of the conjugate material and serve to coat and 
protect the treated tissues. The compositions may also include a dye. such as 
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methylene blue or other inert dye, so that the composition can be seen when 
injected into the eye. Additional active agents may be included. 

The compositions can be enclosed in ampules, disposable syringes or 
multiple or single dose vials made of glass, plastic or other suitable material. 

5 Such enclosed compositions can be provided in kits. In particular, kits 

containing vials, ampules or other containers, preferably disposable vials with 
sufficient amount of the composition to deliver about 0.100 ml thereof, and 
disposable needles, preferably self sealing 25-30 gauge needles, are provided 

herein. 

10 Finally, the compounds may be packaged as articles of manufacture 

containing packaging material, typically a vial, an ophthalmologically acceptable 
composition containing the viral particles and a label that indicates the 

therapeutic use of the composition. 

Also provided are kits for practice of the methods herein. The kits 
15 contain one or more containers, such as sealed vials, with sufficient composition 
for single dosage administration, and one or more needles, such as self sealing 
25-33 gauge needles, preferably 33 gauge or smaller needles, precisely 
calibrated syringes or other precisely calibrated delivery device, suitable for 
intravitreal injection. 

20 Administration of the composition is preferably by intraocular injection, 

although other modes of administration may be effective, if the sufficient 
amount of the compound achieves contact with the vitreous cavity. Intraocular 
injection may be effected by intravitreal injection, aqueous humor injection or 
injection into the external layers of the eye, such as subconjunctival injection or 
25 subtenon injection, or by topical application to the cornea, if a penetrating 
formulation is used. 

Administration 

The compositions containing the compounds are administered 
intraocularly or by other means, such as topically in the form of penetrating 
30 eyedrops, whereby contact of the recombinant vectors with the aqueous humor 
is effected. Intraocular administration may be effected by intravitreal injection, 
aqueous humor injection, injection into the external layers of the eye, such as 
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subconjunctival injection ot subtenon injection, preferably in free form, but, 
alternatively, in liposomes or other sustained drag delivery device. 

Administration is preferably by intravitreal Injection, preferably through self 
sealing 25-30 gauge needles or other suitably calibrated delivery device. 

5 injection Into the eye may be through the pars plana via the self-sealing needle. 

It Is further understood that, for any particular subject, specific dosage 
regimens should be adjusted over time according to the individual need and the 
professional judgment of the person administering or supervising the 
administration of the recombinant viruses, and that the concentration ranges set 
10 forth herein are exemplary only and are not intended to limit the scope or 
practice of the claimed methods 

F. Diseases, Disorders and therapeutic products 
1 , Disease and disorders 

Retinitis pigmentosa 

1S Methods for specifically or selectively targeting recombinant adenovirus 

vectors for delivery of gene products, particularly therapeutic products are 
provided herein. These methods are particularly suitable for targeting cells that 
express receptors that are selectively recognized by Ad virus of subgroup D 
viruses, particularly Ad37. It is shown herein that these viruses select, vely 
20 recognize receptors on cells, such as conjunctival cells and photoreceptors, .ha, 
are no, recognized by other adenoviruses. Hence, methods for targeting to 
these cell types by providing vectors tha, are packaged in viral particles that 
contain a sufficient portion of a fiber protein from one of these Ad serotypes to 
bind to these receptors. These methods are useful for targeting to 
25 photoreceptors and for treating ocular disorders, including, bu, are not limited to, 
inherited and acquired retinal, neovascular degenerative diseases (see table 

below). ,, , 

It is estimated that 1 in 3,500 individuals in the United States suffer from 

one of the pigmented retinopathies. This group of retinal diseases, commonly 
30 called retinitis pigmentosa, is characterized by progressive loss of penpheral and 
night vision. Patients may be affected at almost any age and it is not 
uncommon to experience symptoms in early childhood in certain inherited forms. 
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It has been shown that there are a variety of mutations in genes expressed in 
the photoreceptors, including genes in the rhodopsin gene and pathway that 
appear to be responsible for these diseases. In addition to mutations in 
rhodopsin, changes in the retinal pigmented epithelial (RPE) cells, also undergo 
5 degenerative changes and can form clumps of pigment that give rise to the 
characteristic pigmentary changes seen in patients with RP. 

Angiogenesis and ocular diseases and disorders 
The vast majority of diseases that cause catastrophic loss of vision do so 
as a result of ocular neovascularization; age related macular degeneration 
10 (ARMD) affects 1 2-1 5 million American over the age of 65 and causes visual 
loss in 10-15% of them as a direct effect of choroidal (sub-retinal) 
neovascularization. The leading cause of visual loss for Americans under the 
age of 65 is diabetes; 1 6 million individuals in the United States are diabetic and 
40,000 per year suffer from ocular complications of the disease, which often are 
15 a result of retinal neovascularization. Laser photocoagulation has been effective 
in preventing severe visual loss in subgroups of high risk diabetic patients, but 
the overall 10 year incidence of retinopathy remains essentially unchanged. For 
patients with choroidal neovascularization due to ARMD or inflammatory eye 
disease, such as ocular histoplasmosis, photocoagulation, with few exceptions, 
20 is ineffective in preventing visual loss. While recently developed, non- 
destructive photodynamic therapies hold promise for temporarily reducing 
individual loss in patients with previously untreatable choroidal 
neovascularization, only 61.4% of patients treated every 3-4 months had 
improved or stabilized vision compared to 45.9% of the placebo-treated group. 
25 in the normal adult, angiogenesis is tightly regulated and limited to wound 

healing, pregnancy and uterine cycling. Angiogenesis is turned on by specific 
angiogenic molecules such as basic and acidic fibroblast growth factor (FGF), 
vascular endothelial growth factor (VEGF). angiogenin, transforming growth 
factor (TGF), tumor necrosis factor-a (TNF-a) and platelet derived growth factor 
30 (PDGF). Angiogenesis can be suppressed by inhibitory molecules such as 

interferon-o, thrombospondin-1 , angiostatin and endostatin. It is the balance of 
these naturally occurring stimulators and inhibitors that controls the normally 
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quiescent capillary vasculature. When this balance is upset, as in certain disease 
states, capillary endothelial cells are induced to proliferate, migrate and 
ultimately differentiate. k 

Angiogenesis plays a central role in a variety of diseases, including, but 
5 are not limited to, cancer and ocular neovascularization. Sustained growth and 
metastasis of a variety of tumors has also been shown to be dependent on the 
growth of new host blood vessels into the tumor in response to tumor derived 
angiogenic factors. Proliferation of new blood vessels in response to a variety of 
stimuli occurs as the dominant finding in the majority of eye diseases that blind, 

10 such as, but are not limited to, proliferative diabetic retinopathy (PDR), ARMD, 
rubeotic glaucoma, interstitial keratitis and retinopathy of prematurity. In these 
diseases, tissue damage can stimulate release of angiogenic factors resulting in 
capillary proliferation. VEGF plays a dominant role in iris neovascularization and 
neovascular retinopathies. While reports clearly show a correlation between 
15 intraocular VEGF levels and ischemic retinopathic ocular neovascularization, FGF 
likely plays a role. Basic and acidic FGF are known to be present in the normal 
adult retina, even though detectable levels are not consistently correlated with 
neovascularization. This may be largely due to the fact that FGF binds very 
tightly to charged components of the extracellular matrix and may not be readily 
20 available in a freely diffusible form that would be detected by standard assays of 
intraocular fluids. 

A final common pathway in the angiogenic response involves integrin- 
mediated information exchange between a proliferating vascular endothelial cell 
and the extracellular matrix. This class of adhesion receptors, called integnns, 

25 are expressed as heterodimers having an a and 0 subunit on all cells. One such 
integrin, o A , is the most promiscuous member of this family and allows 
endothelial cells to interact with a wide variety of extracellular matrix 
components. Peptide and antibody antagonists of this integrin inhibit 
angiogenesis by selectively inducing apoptosis of the proliferating vascular 
30 endothelial cells. Two cytokine-dependent pathways of angiogenesis exist and 
may be defined by their dependency on distinct vascular cell integnns, afi, and 
OJJ.. Specifically, basic FGF- and VEGF-induced angiogenesis depend on mtegnn 
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a j} 3 anc j respectively, since antibody antagonists of each integrin 
selectively block one of these angiogenic pathways in the rabbit corneal and 
chick chorioallantoic membrane (CAM) models. Peptide antagonists that block 
all 0 V integrins inhibit FGF- and VEGF-stimulated angiogenesis. While normal 
5 human ocular blood vessels do not display either integrin, aj 3 and o^ s integrins 
are selectively displayed on blood vessels in tissues from patients with active 
neovascular eye disease. While only aj 3 was consistently observed in tissue 
from patients with ARMD,cy? 3 and af* were present in tissues from patients 
with PDR. Systemically administered peptide antagonists of integrins blocked 
10 new blood vessel formation in a mouse model of retinal vasculogenesis. 

In addition to adhesion events described above, cell migration through the 
extracellular matrix also depends on proteolysis. Matrix metalloproteinases are a 
family of zinc-requiring matrix-degrading enzymes that include the collagenases, 
gelatinases and stromelysins, all of which have been implicated in invasive cell 
15 behavior. Invasive cell processes such as tumor metastasis and angiogenesis 
have been found to be associated with the expression of integrins and MMP-2, 
MM P-2 are all found throughout the eye where they may interact to maintain a 
quiescent vasculature until the balance is upset, resulting in pathological 
angiogenesis. A non-catalytic C-terminal hemopexin-like domain of MMP-2 
20 (PEX) can block cell surface collagenolytic activity and inhibit angiogenesis in the 
CAM model by preventing localization of MMP-2 to the surface of invasive cells 

through interaction with the integrin oji 3 » 

Hence, anti-angiogenic agents have a tole in treating retinal degeneration 

to prevent the damaging effects of these trophic and growth factors. 

25 Angiogenic agents, also have a role in promoting desirable vascularization to 
retard retinal degeneration by enhancing blood flow to cells. 

Members of adenovirus subgroup D, Ad8, ISA. and 37. are infectious 
agents that cause particularly severe cases of epidemic keratoconjunctivitis 
(EKC) (Arnberg ef a/. (1998) Virology 227: 239-244, Curtis ef a/. (1998) 

30 J Med.Microbioi. 47:91-94; Ritterband ef a/. (1998) Boy. Med. Virol. 8:187-201; 
and Takeuchi at a/. (1999) J.Ciin.Microbioi. 37:3392-3394). There is no 
effective treatment for this debilitating and contagious disease and EKC 
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continues to be a problem in ophthalmology clinics worldwide (Curtis et al. 

(1998 ) J. Med. Microbiol. 47:91-94, Lukashok et al. (1998) 

Curr.Clin. Top.Infec.Dis. 73:286-304). Hence the vectors herein may be used for 
treating the disease. 

Table 3 



uanaiucuti wiycw ww— — - ~ : ~ 

1 CAwnmATE TARGETS FOR OCULAR DISEASE THERAPY 


Disease 


Candidate target(s) 


Retinitis pigmentosa 


Rhodopsin gene, and genes that 
regulate expression thereof 
/■cfe/peripherin 


Staraardt's disease 


rim protein (ARC protein) 






Choroideremia 


rab geranylgeranyl transferase 
CHM, TCD, CHML* 


Gyrate Atrophy 


ornithine aminotransferase 


Macular dystrophy 


rrfs/peripherin 



10 



15 



20 



3: 193-4 (1993) 




25 



Exudative Choroidal Diseases 



ICSC, fluorescein angiogram 



ICSC with lame serious detachment of RPE (retinal pigmented epithelium) 



ICSC with bullous retinal detachment 



Macular drusen, exudative, confluent 



Drusen, sub-RPE choroidal neovascularization 



Drusen, notched serous detachment of RPE_ 






Other Diseases 

Dru sen, notched serous and hemorrhagic detachment of RPE 

Drusen, serous and hemorrhagic detachment of RPE and retina 

Drusen, organized RPE detachment causing bullous retinal detachment 

Drusen, geographic atrophy of RPE 

Drusen, exudative and cuticular, vitelliform macular detachment 

Drusen, cuticular, large vitelliform macular detachment 

North Carolina dystrophy with macular staphyloma 

North Carolin a dystrophy with macular staphyloma ; 

Angioid streaks, pseudoxanthoma elasticum (PXE), CNVM 

Angioid streaks, PXE, large notched retinal detachment 

Myopic degeneration, Foerster-Fuchs spot __ 

Pres umed ocular histoplasmosis syndrome (POHS) 

Subm acular bacterial abscess 

Toxocara canis, subretinal granuloma - 

Serpiginous (geographic) choroiditis ; 

Pos terior scleritis 

Harada's dis ease - 

Posterior sy mpathetic uveitis 

Benign reactive lymphoid hyperplasia of uveal tract 

Ch oroidal ruptures and CNVM 

Cavern ous hemangioma of choroid 

Choroi dal osteoma 

Choroi dal nevus, serous macular detachment 

Chor oidal nevus with CNVM 

Diffuse sclerochoroidal melanocytic nevus 

Ch oroidal melanoma with serous detachment of RPE 

Metast atic lung carcinoma to choroid ; ■ 

Sub-RPE reticulum cell sarcoma 
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Other Diseases 

RPE tear, idiopathic choroidal neovascularization 

Heredo dystrophic Disorders Affecting RPE & Retina 

Best's v itelliform macular dystrophy . 

Best's vi telliform macular dystrophy with CNVM 

Best's vi telliform macular dystrophy, multiple lesions 

Adult-ons et vitelliform foveomacular dystrophy 

Pattern dyst rophy simulating fundus flavimaculatus 

Stargardt' s disease (fundus flavimaculatus) 

Asteroid macular dystrophy 



Sjo gren-Larssen syndrome 

Oguchi's disease, light-adapted state 

Oguc hi's disease, dark-adapted state 

Fund us albipunctatus 

R etinitis pigmentosa, cystoid macular edema 

Crystalline tapetoretinal dystrophy 



Choroideremia ■ 



Goldmann-Favre syndrome ___ — 



Sex-linked juvenile retinoschisis . 



Perivenous re tinitis pigmentosa . 

Retinal Vascular Disorders 



Retinal arteriovenous aneurysm 



Central r etinal artery occlusion ' 

Cilioreti nal artery obstruction 

ischemic retinopathy in systemic lupus erythematosus 

ischemi c retinopathy in scleroderma _ 

Hemorrhagic detachment of internal limiting membrane, hypertensive 
retinopathy 



Acquired retinal arterial macroaneurysm 
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1 Other Diseases 


j| Cystoid macular edema, aphakic .1 


II Cystoid macular edema, nicotinic acid maculopathy 


1 Congenital retinal telangiectasis 1 


I Acquired bilateral juxtafoveal telangiectasis - 1 


1 Acquired bilateral juxtafoveal obliterative telangiectasis 


I Diabetic optic neuropathy . — . 1 


JJ X-ray radiation exudative retinopathy 1 


| Sickle cell SC disease, macular hemorrhage 


| Retinal arterial aneurysms, arteritis, neuroretinitis 


1 Branch retinal vein obstruction (BRVO) 


||bRVO, exudative maculopathy — 


BRVO, optic disc new vessels, photocoagulation _ 


Waldenstrom's macroglobulinemia | 


jj Inflammatory Diseases of the Retina and Choroid 


|| Luetic retinal vasculitis 


|| Focal Candida retinal abscess — 1| 


|| Toxoplasmosis, atrophic chorioretinal scar 


| Toxoplasmosis retinitis and macular detachment 1| 


| Toxoplasmosis scar, CNVM, macular detachment jj 


|j Diffuse unilateral subacute neuroretinitis, small worm 1 


| Diffuse unilateral subacute neuroretinitis, large worm 11 


I Cytomegalic inclusion disease, papillitis 11 


1 Acute posterior multifocal placoid pigment epitheliopathy 


II Acute macular neuroretinitis 


Sarcoid retinitis 1 


|| Sarcoid papillitis j| 


Behcet's disease ____ - — 1 


[| Vitiliginous (bird-shot) chorioretinitis _ U 
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Other Diseases 

M ultifocal choroiditis and panveitis (pseudo-POHS) 

Retin al and Pigment Epithelial Hamartomas 

Congenital grouped albinotic RPE spots 

C ongenital hyperplasia of RPE 

Combin ed RPE and retinal hamartoma, juxtapapillary 

Com bined RPE and retinal hamartoma, peripheral 

Cysti c astrocytoma, juxtapapillary — 

Astrocytoma, macula 



As trocytoma, juxtapapillary 

Cavernou s hemangioma of retina 

| J uxtapapillary sessile retinal capillary hemangioma — 

Juxtapapillary endophytic retinal capillary hemangioma 

Other T u mors of the Choroid 

Chor oidal metastasis 

Choroid al osteoma 

Choroid al hemangioma 

Mis cellaneous uveal tumors . 

Intraoc ular Lymphoid Tumors 

The leukemias and lymphomas 

T umors of the Vitreous 

Non-Hodgkins ("reticulum cell") lymphoma — 

T umor involvement of the vitreous cavity 

Macul ar Disease 

Age-related macular degeneration -- atrophic form 

Ex udative age-related macular degeneration 

Choroidal neovascular membrane in degenera tive myopia^ 
Central serous retinopathy — 
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Other Diseases 

Endogenous fungal infections of the retina and choroid 

Par s planitis 

Syphil is and tuberculosis 

Diffuse unilateral subacute neuroretinitis - 

5 S cleritis — 

Birdshot retinochoroidopathy 

Punctate i nner choroidopathy 

Sarcoidosis — 

Acute multifocal placoid pigment epitheliopathy 

10 Geographic helicoid peripapillary choroidopathy (GHPC): serpiginous 

c horoiditis — — 

S ympathetic ophthalmia - 

Vogt-Koyanigi-Harada syndrome (uveom eningitic syndrome) 

Ciliochoroidal (uveal) effusion 

15 „ . ct^rposcooic Atlas of Ocular Diseases Diagnosis and Treat- 

Reproduced from. Stereoscopic Mtia _ r v Moslev Co. (1987); 

***** - 

(1989). 

20 2. Therapeutic products 

Therapeutic products include but are not limited to. wild-type genes that 
are defective In ocular disorders, such as rhodopsin. or fragments thereof 
sufficient to correct the genetic defect, trophic factors, including growth factors, 
inhibitors and agonists of trophic factors, anti-apoptosis factors and other 
26 products described herein or known to thos. of skill In the art to be useful for 
treatment of disorders of the eye or that can be treated by a product expressed 

by a photoreceptor. 



OCULAR GENE THERAPY STRATEGIES 

fiFNFRAL DISEASE 1 EXAMPLES . 1 STRATE GY 



WO 01/83729 



PCT/EP01/04863 



-50- 



OCULAR GENE THERAPY STRATEGIES 


Hereditary retinal and 
macular degeneration 


• Retinitis pigmentosa 

• Stargardt's disease 

• Other macular 
dystrophies 


Growth factors (e.g., 
GDNF) 

anti-apoptotic factors 
(e.g., bc!2 gene) 
Stargardt Disease Gene 
(ABCR) f 


Neovascular 


• Diabetes 

• Choroidal 
neovascularization 


Anti-angiogenesis 

factors 


Anti-tumor 


Retinoblastoma 


Antiproliferant 


Glaucoma 

i i ih nn 


Nerve fiber layer 
atrophy 

977-lftn.R-IHOi 


Neuroprotective agent 

J . 



rui *v^. 

can deliver a wild-type rhodopsin gene or a growth factor or trophic factor, such 
as ciliary neurotrophic factor CNTF; for treatment of Stargardt's disease, the 
10 vector can deliver a wild type ABCR (also called STGD1 ) or a growth factor or 
anti-angiogenic agent; for diabetic retinopathies, retinal vascularization the 
vector can deliver growth factors, such as a TGF (TGF/?), to prevent 
degeneration. 

1 5 The following examples are included for illustrative purposes only and are 

not intended to limit the scope of the invention. 

EXAMPLE 1 

Preparation of Adenovirus Packaging Cell Lines 

Cell lines that are commonly used for growing adenovirus are useful as 
20 host cells for the preparation of adenovirus packaging cell lines. Preferred cells 
include 293 cells, an adenovirus-transformed human embryonic kidney cell line 
obtained from the ATCC, having Accession Number CRL 1 573; HeLa, a human 
epithelial carcinoma cell line (ATCC Accession Number CCL-2); A549, a human 
lung carcinoma cell line (ATCC Accession Number CCL 1889); and other 
25 epithelial-derived cell lines. As a result of the adenovirus transformation, the 
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293 cells contain the El early region regulatory gene. All cells were maintained 
in complete DMEM + 10% fetal calf serum unless otherwise noted. 

These cell lines allow the production and propagation of adenovirus-based 
gene delivery vectors that have deletions in preselected gene regions and that 
5 are obtained by cellular complementation of adenoviral genes. To provide the 
desired complementation of such deleted adenoviral genomes in order to 
generate a viral vector, plasmid vectors that contain preselected functional units 
have been designed. Such units include but are not limited to El early region, 

E4 and the viral fiber gene. The preparation of plasmids providing such 
10 complementation, thereby being "complementary plasmids or constructs," that 
are stably inserted into host cell chromosomes are described below. 

A. Preparation of an E4-Expressing Plasmid for Complementation of 
E4-G ene-Delete d Adenoviruses 

The viral E4 regulatory region contains a single transcription unit that is 
15 alternately spliced to produce several different mRNA products. The 

E4-expressing plasmid prepared as described herein and used to transfect the 
293 cell line contains the entire E4 transcription unit. A DNA fragment 
extending from 175 nucleotides upstream of the E4 transcription start site 
including the natural E4 promoter to 153 nucleotides downstream of the E4 
20 polyadenylation signal including the natural E4 terminator signal, corresponding 
to nucleotides 32667-35780 of the adenovirus type 5 (hereinafter referred to as 
Ad5) genome as described in Chroboczek et at. (Virol., 186:280-285 (1992), 
GenBank Accession Number M73260), was amplified from Ad5 genomic DNA, 
obtained from the ATCC, via the polymerase chain reaction (PCR). Sequences of 
25 the primers used were 5 'CGGTAC AC AGA^HC AGG AG AC AC AACTCC3' 

(forward or 5' primer referred to as E4L) (SEQ ID NO: 1 ) and 

5'GCCTGGATCCGGGAAGTTACGTAACGTGGGAAAAC3' (SEQ ID NO: 2) 

(backward or 3' primer referred to as E4R). To facilitate cloning of the PCR 
fragment, these oligonucleotides were designed to create new sites for the 
30 restriction enzymes EcoRI and BamHI, respectively, as indicated with underlined 
nucleotides. DNA was amplified via PCR using 30 cycles of 92 C for 1 minute. 
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BO C for 1 minute, and 72 C for 3 minutes resulting in amplified full-length E4 
gene products. 

The amplified DNA E4 products were then digested with EcoRI and 
BamHI for cloning into the compatible sites of pBluescript/SK+ by standard 
5 techniques to create the plasmid pBS/E4. A 2603 base pair (bp) cassette 
including the herpes simplex virus thymidine kinase promoter, the hygromycin 
resistance gene, and the thymidine kinase polyadenylation signal was excised 
from the plasmid pMEP4 (Invitrogen, San Diego, CA) by digestion with Fspl 
followed by addition of BamHI linkers (5'CGCGGATCCGCG3') (SEQ ID NO: 3) 

10 for subsequent digestion with BamHI to isolate the hygromycin-containing 
fragment. 

The isolated BamHI-modified fragment was then cloned into the BamHI 
site of pBS/E4 containing the E4 region to create the plasmid pE4/Hygro 
containing 8710 bp. The P E4/Hygro plasmid has been deposited with the ATCC 
15 under accession number 97739. The complete nucleotide sequence of 

pE4/Hygro is set forth in SEQ ID NO: 4. Position number 1 of the linearized 
vector corresponds to approximately the middle portion of the pBS/SK + 
backbone. The 5' and 3' ends of the E4 gene are located at respective 
nucleotide positions 3820 and 707 of SEQ ID NO: 4 while the 5' and 3' ends of 
20 the hygromycin insert are located at respective nucleotide positions 3830 and 

6470. In the clone that was selected for use, the E4 and hygromycin resistance 
genes were divergently transcribed. 

B. Preparation of a Fiber-Expressing Plasmid for Complementation of 
Fiber-Gene-Deleted Adenoviruses 

25 To prepare a fiber-encoding construct, primers were designed to amplify 

the fiber coding region from Ad5 genomic DNA with the addition of unique 
BamHI and Notl sites at the 5' and 3' ends of the fragment, respectively. The 
Ad5 nucleotide sequence is available with the GenBank Accession Number 
Ml 8369. The 5' and 3' primers had the respective nucleotide sequences of 
30 5'ATGGGMCCAAGATGAAGCGCGCAAGACCG3' (SEQ ID NO: 5) and 

5'CATAACGCGGCCGCTTCTTTATTCTTGGGC3' (SEQ ID NO: 6), where the 
inserted BamHI and Notl sites are indicated by underlining. The 5' primer also 
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10 



contained a nucieotide substitution 3 nucleotides 5' o. the second ATG codon C 
A, the, is the initiation site. The nucleotide substitution was included so as to 
improve the consensus for initiation of fiber protein translation. 

The amplified DNA fragment was inserted into the BamH, and Ho, I sites 
PCONA3 (invitrogen) to create ,h. plasmid designated pCDNAG, Fiber having 
7,48 bp. The parent plasmid contained ,h. CMV promoter, the bovine growth 
hormone (BHG) terminator and the gene fo, conferring neomycin resistance. The 
viral sequence included in this construct corresponds to nucleotides 

31040-32791 of the Ad5 genome. 

The complete nucleotide sequence of pCDNA3/F,ber is listed ,n SEQ ID 

NO- 7 where the nucleotide position , corresponds to approximately the middle 

the PODNA3 vector sequence. The S’ and 3’ ends of the fiber gene are 

located at respective nucleotide positions 91 6 w.th ATG and 2661 wit 

To enhance expression of fiber protein by the constitutive CMV promoter 

l6 provided by the pcDNA vector, a Bg„. .ragmen, containing the tripartite leader 

(TPU of adenovirus type 5 was excised from pRD, 1 2a (Sheay at a . 

Bio Techniques, ,5:856-862 1,993) and inserted into the BamHI sue 0, 

P CDNA3/Fiber to create the plasmid pCLF having 7469 bp. The aden 

tripartite leader sequence, present a, the 5’ end o, all maior late 

mRNAs as described by Logan et a,.. Proo.NoB. Acad. Sc., USA. 8, 3655- 

3659 ,1984, and BerKner. 6,„ Techniques. 6:6,6-629 „988l, also referred o as 

a -partial TPL- since i, contains a partial exon ,. shows correspondence With 

A d5 leader sequence having three spatially separated exons correspon ding to 

com Rnnq (the 3' end of the first leader segment), 7111- 
nucleotide positions 6081-6089 (the J enu 

and 9644-9845 (the third leader 

25 7182 (the entire second leader segment), 

segment and sequence downstream o, that segment,. The corresponding , oDNA 

sequence of the partial tripartite leader sequence present ,n pCLF ,s included n 
SEQ ID NO: 8 bordered by BamHI, Bglll S’ and 3’ sites a, respect, ye nuo eotide 
positions 907-9,2,0 , 228-1233. The nucleotide sequence of an , sorted p 
TPL is also listed separately as SEQ ID No. 22 with the noted 5 and 3 
restriction sites and with the following nucleotide regions iden , ,e . 
site: 1-18 n, polylinker; ,9-27 nt last 9 nt o, the firs, leader segment lexon 



20 



30 
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28-99 nt second leader segment (exon 2); 1 00-1 87 nt third leader segment 
(exon 3); 188-301 nt contains the nt sequence immediately following the third 
leader in the genome with an unknown function; and 322-327 nt Bglll site. 

The pCLF plasmid has been deposited with the ATCC as described in 
5 Example 4. The complete nucleotide sequence of pCLF is listed in SEQ ID NO. 8 
where the nucleotide position 1 corresponds to approximately the middle of the 
pcDNA3 parent vector sequence. The 5' and 3' ends of the Ad5 fiber gene are 
located at respective nucleotide positions 1237-1239 with ATG and 2980-2982 

with TAA. 

10 c. Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 

Encoding Functional E4 and Fiber Proteins 

The 293 cell line was selected for preparing the first adenovirus 
packaging line as it already contains the El gene as prepared by Graham eta!., 

J. Gen. Virol., 36: 59-74 (1977) and as further characterized by Spector, Virol., 

15 730:533-538 (1983). Before electroporation, 293 cells were grown in RPMI 

medium + 10% fetal calf serum. Four x 106 cells were electroporated with 20 
fjg each of pE4/Hygro DNA and pCLF DNA using a BioRad GenePulser and 
settings of 300 V, 25 juF. DNA for electroporation was prepared using the 
Qiagen system according to the manufacturer's instructions (Bio-Rad, Richmond, 

20 CA). 

Following electroporation, cells were split into fresh complete DMEM + 
10% fetal calf serum containing 200//g/ml Hygromycin B (Sigma, St. Louis, 

MO). 

From expanded colonies, genomic DNA was isolated using the 
25 "M1CROTURBOGEN" system (Invitrogen) according to manufacturer s 

instructions. The presence of integrated E4 DNA was assessed by PCR using 
the primer pair E4R and ORF6L ( 5 'TGCTTAAGCGGCCGCGAAGGAGA AGTCC3') 
(SEQ ID NO: 9), the latter of which is a 5' forward primer near adenovirus 5 

open reading frame 6. 

30 One clone, designated 21 1 , was selected exhibiting altered growth 

properties relative to that seen in parent cell line 293. The 21 1 clone contained 
the product.indicating the presence of inserted DNA corresponding to most, if 
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not all, of the E4 fragment contained in the pE4/Hygro plasmid. The 21 1 cell 
line has been deposited with the ATCC as described in Example 4. This line was 
further evaluated by amplification using the primer pair E4L/E4R described 
above, and a product corresponding to the full-length E4 insert was detected. 

5 Genomic Southern blotting was performed on DNA restricted with EcoRI and 
BamHI. The E4 fragment was then detected at approximately one copy/genome 
compared to standards with the EcoRI/BamHI E4 fragment as cloned into P BS/E4 
for use as a labeled probe with the Genius system according to manufacturer's 
instructions (Boehringer Mannheim, Indianapolis, IN). In DNA from the 211 cell 
10 line, the labeled internal fragment P E4/Hygro hybridized with the isolated E4 

sequences. In addition, the probe hybridized to a larger fragment which may be 

the result of a second insertion event. 

Although the 21 1 cell line was not selected by neomycin resistance, thus 

indicating the absence of fiber gene, to confirm the lack of fiber gene, the 21 1 
15 cell line was analyzed for expression of fiber protein by indirect 

immunofluorescence with an anti-fiber polyclonal antibody and a FITC-labeled 
anti-rabbit IgG (KPL) as secondary. No immunoreactivity was detected. 

Therefore, to generate 21 1 clones containing recombinant fiber genes, the 211 
clone was expanded by growing in RPMI medium and subjected to additional 
20 electroporation with the fiber-encoding pCLF plasmid as described above. 

Following electroporation, cells were plated in DMEM + 10% fetal calf 
serum and colonies were selected with 200//g/ml G418 (Gibco, Gaithersburg, 
MD). Positive cell lines remained hygromycin resistant. These candidate 
sublines of 21 1 were then screened for fiber protein expression by indirect 
25 immunofluorescence as described above. The three sublines screened. 211 A. 

21 1 B and 21 1 R, along with a number of other sublines, all exhibited nuclear 
staining qualitatively comparable to the positive control of 293 cells infected 
with AdRSV/?gal (1 pfu/cell) and stained 24 hours post-infection. 

Lines positive for nuclear staining in this assay were then subjected to 
30 Western blot analysis under denaturing conditions using the same antibody. 

Several lines in which the antibody detected a protein of the predicted molecular 
weight (62 kd for the Ad5 fiber protein) were selected for further study including 




WO 01/83729 



PCT/EP01/04863 



-56- 



21 1 A, 21 IB and 21 1R. The 211 A cell line has been deposited with ATCC as 
described in Example 4. 

Immunoprecipitation analysis using soluble nuclear extracts from these 
three cell lines and a seminative electrophoresis system demonstrated that the 
5 fiber protein expressed is in the functional trimeric form characteristic of the 
native fiber protein. The predicted molecular weight of a trimerized fiber is 186 
kd. Under denaturing conditions, the trimeric form was destroyed resulting in 
detectable fiber monomers. Those clones containing endogenous El, newly 
expressed recombinant E4 and fiber proteins were selected for use in 
10 complementing adenovirus gene delivery vectors having the corresponding 

adenoviral genes deleted as described in Example 2. 

D. Preparation of an El -Expressing Plasmid for Complementation of 
El -Gene-Deleted Adenoviruses 

In order to prepare adenoviral packaging cell lines other than those based 
15 on the El -gene containing 293 cell line as described in Example 1C above, 

plasmid vectors containing El alone or in various combinations with E4 and fiber 

genes are constructed as described below. 

The region of the adenovirus genome containing the El a and Elb gene is 
amplified from viral genomic DNA by PCR as previously described. The primers 
20 used are El L, the 5' or forward primer, and El R, the 3' or backward primer, 
having the respective nucleotide sequences 5'CCG A GCTAG C 
GACTGAAAATGAG3' (SEQ ID NO: 10) and 5'CCT CTCGA G AGACAGC 
AAGACAC3' (SEQ ID NO: 11). The E1L and E1R primers include the respective 
restriction sites Nhel and Xhol as indicated by the underlines. The sites are used 
25 to clone the amplified El gene fragment into the Nhel/Xhol sites in pMAM 
commercially available from Clontech (Palo Alto, CA) to form the plasmid 
pDEX/EI having 11152 bp. 

The complete nucleotide sequence of pDEX/EI is listed in SEQ ID NO: 1 2 
where the nucleotide position 1 corresponds to approximately 1454 nucleotides 
30 from the 3' end of the pMAM backbone vector sequence. The pDEX/EI plasmid 
includes nucleotides 552 to 4090 of the adenovirus genome positioned 
downstream (beginning at nucleotide position 1460 and ending at 4998 in the 
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pDEX/EI plasmid) of the glucocorticoid-inducible mouse mammary tumor virus 
(MMTV) promoter of pMAM. The pMAM vector contains the EmM gpt gene 
that allows stable transfectants to be isolated using hypoxanthine/amino- 
pterin/thymidine (HAT) selection. The pMAM backbone occupies nucleotide 

5 positions 1-1 454 and 5005-1 11 52 of SEQ ID NO: 12. 

E. Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 
Encoding Functional El, and Fiber Proteins 

To create separate adenovirus packaging cell lines equivalent to that of 
the 211 sublines, 211 A, 21 IB and 21 1R, as described in Example 1C, 

10 alternative cell lines lacking adenoviral genomes are selected for transfection 
with the plasmid constructs as described below. Acceptable host cells include 
A549, Hela, Vero and the like cell lines as described in Example 1 . The selected 
cell line is transfected with the separate plasmids, pDEX/Eland pCLF, 
respectively for expressing El, and fiber complementary proteins. Following 
15 transfection procedures as previously described, clones containing stable 

insertions of the two plasmids are isolated by selection with neomycin and HAT. 
Integration of full-length copy of the El gene is assessed by PCR amplification 
from genomic DNA using the primer set El L/E1 R , as described above. 

Functional insertion of the fiber gene is assayed by staining with the anti-fiber 

20 antibody as previously described. 

The resultant stably integrated cell line is then used as a packaging cell 
system to complement adenoviral gene delivery vectors having the 
corresponding adenoviral gene deletions as described in Example 2. 

F. Preparation of a Plasmid Containing Two or More Adenoviral Genes for 
25 Complementing Gene-Deleted Adenoviruses 

The methods described in the preceding Examples rely on the use of two 
plasmids. pE4/Hygro and pCLF, or, pCLF and pDEX/EI for generating adenoviral 
cell packaging systems. In alternative embodiments, complementing plasmids 
containing two or more adenoviral genes for expressing of encoded proteins in 
30 various combinations are also prepared as described below. The resultant 

plasmids are then used in various cell systems with delivery plasmids having the 
corresponding adenoviral gene deletions. The selection of packaging cell, 
content of the delivery plasmids and content of the complementing plasmids for 
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use in generating recombinant adenovirus viral vectors thus depends on whether 
other adenoviral genes are deleted along with the adenoviral fiber gene, and, if 
so, which ones. 

1 . Preparation of a Complementing Plasmid Containing Fiber 
5 and El Adenoviral Genes 

A DNA fragment containing sequences for the CMV promoter, adenovirus 
tripartite leader, fiber gene and bovine growth hormone terminator is amplified 
from pCLF prepared in Example 1 B using the forward primer 
5-GACGGATCGGGAGATCTCC3' (SEQ ID NO: 13), that anneals to the 
10 nucleotides 1-19 of the pCDNA3 vector backbone in pCLF, and the backward 
primer 5'CCGCCTCAGAAGCCATAGAGCC3' (SEQ ID NO: 14) that anneals to 
nucleotides 1 278-1 257 of the pCDNA3 vector backbone. The fragment is 
amplified as previously described and then cloned into the pDEX/EI plasmid, 
prepared in Example ID. For cloning in the DNA fragment, the pDEX/EI vector 
15 is first digested with Ndel, that cuts at a unique site in the pMAM vector 
backbone in pDEX/EI, then the ends are repaired by treatment with 

bacteriophage T4 polymerase and dNTPs. 

The resulting plasmid containing El and fiber genes, designated 
pEI /Fiber, provides dexamethasone-inducible El function as described for 
20 DEX/E1 and expression of Ad5 fiber protein as described above. 

The complete nucleotide sequence of pEI /Fiber is listed in SEQ ID NO: 15 
where the nucleotide position 1 corresponds to approximately 1459 nucleotides 
from the 3' end of the parent vector pMAM sequence. The 5' and 3' ends of 
the Ad5 El gene are located at respective nucleotide positions 1460 and 4998 
25 followed by pMAM backbone and then separated from the Ad5 fiber from pCLF 
by the filled-in blunt ended Ndel site. The 5' and 3' ends of the pCLF fiber gene 
fragment are located at respective nucleotide positions 10922-14223 containing 

elements as previously described for pCLF. 

The resultant pEI /Fiber plasmid is then used to complement one or more 

30 delivery plasmids expressing El and fiber. 

The pEI /Fiber construct is then used to transfect a selected host cell as 
described in Example IE to generate stable chromosomal insertions preformed as 
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previously described followed by selection on HAT medium. The stable cells are 
then used as packaging cells as described in Example 2. 

2. Preparation of a Complementing Plasmid Containing E4 and 
Fiber Adenoviral Genes 

5 Plasmid pCLF prepared as described in Example 1 B is partially digested 

with Bglll to cut only at the site in the pCDNA3 backbone. The pE4/Hygro 
plasmid prepared in Example 1A is digested with BamHI to produce a fragment 
containing E4. The E4 fragment is then inserted into the BamHI site of pCLF to 
form plasmid pE4/Fiber. The resultant plasmid provides expression of the fiber 
10 gene as described for pCLF and E4 function as described for pE4/Hygro. 

A schematic plasmid map of pE4/Fiber, having 10610 bp. The complete 
nucleotide sequence of pE4/Fiber is listed in SEQ ID NO: 16 where the 
nucleotide position 1 corresponds to approximately 1 4 bp from the 3' end of the 
parent vector pCDNA3 backbone sequence. The 5' and 3' ends of the Ad5 E4 
15 gene are located at respective nucleotide positions 21 and 3149 followed by 
fused Bglll/BamHI sites and pCDNA3 backbone including the CMV promoter 
again followed by Bglll/BamHI sites. The adenovirus leader sequence begins at 
nucleotide position 4051 and extends to 4366 followed by fused BamHI/Bglll 
sites and the 5' and 3' ends of the fiber gene located at respective nucleotide 
20 positions 4372 and 6124. 

Stable chromosonal insertions of pE4/Fiber in host cells are obtained as 



25 



30 



described above. 



EXAMPLE 2 



Preparation of Adenoviral Gene Delivery Vectors Using Adenoviral Packaging Cell 
Lines 

Adenoviral delivery vectors are prepared to separately lack the 
combinations of El /fiber and E4/fiber. Such vectors are more replication- 
defective than those previously in use due to the absence of multiple viral genes. 
A preferred adenoviral delivery vector is replication competent but only via a 
non-fiber means is one that only lacks the fiber gene but contains the remaining 
functional adenoviral regulatory and structural genes. Furthermore, these 
adenovirus delivery vectors have a higher capacity for insertion of foreign DNA. 
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A. Preparation of Adenoviral Gene Delivery Vectors Having Specific Gene 
Deletions and Methods of Use 

To construct the El / fiber deleted viral vector containing the LacZ 
reporter gene construct, two new plasmids were constructed. The plasmid pA 
5 El B/?gal was constructed as follows. A DNA fragment containing the SV40 

regulatory sequences and E. colip - galactosidase gene was isolated from pSV^gal 
(Promega) by digesting with Vspl, filling the overhanging ends by treatment with 
Klenow fragment of DNA polymerase I in the presence of dNTP's and digesting 
with Bam HI . The resulting fragment was cloned into the EcoRV and BamHI 
10 sites in the polylinker of pA ElsplB (Microbix Biosystems, Hamilton, Ontario) to 
form pA El B jffgal that therefore contained the left end of the adenovirus genome 
with the Ela region replaced by the LacZ cassette (nucleotides 6690 to 4151) of 
pSVjff gal. Plasmid DNA may be prepared by the alkaline lysis method as 
described by Birnboim and Doly, Nuc. Acids Res., 7:1513-1523 (1978) or by the 
1 5 Quiagen method according to the manufacturer' s instruction, from transformed 
cells used to expand the plasmid DNA was then purified by CsCI-ethidium 
bromide density gradient centrifugation. Alternatively, plasmid DNAs may be 
purified from E. coii by standard methods known in the art (e.g. see Sambrook et 
ai.) 

20 The second plasmid (pDV44), prepared as described herein, is derived 

from pBHGIO, a vector prepared as described by Bett et ah, Proc. Natl. Acad. 
Sci., USA, 91:8802-8806 (1994) (see, also International PCT application No. 

WO 95/00655) using methods well known to one of skill in the art. This vector 
is also commercially available from Microbix and and contains an Ad5 genome 
25 with the packaging signals at the left end deleted and the E3 region (nucleotides 
28133:30818) replaced by a linker with a unique site for the restriction enzyme 
Pad. An 1 1 .9 kb BamHI fragment, which contains the right end of the 
adenovirus genome, is isolated from pBHGIO and cloned into the BamHI site of 
pBS/SK( + ) to create plasmid pi 1 .3 having approximately 14,658 bp. The pi 1 .3 
30 plasmid was then digested with Pad and Sail to remove the fiber, E4, and 
inverted terminal repeat (ITR) sequences. 
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This fragment was replaced with a 3,4 kb fragment containing the ITR 
segments and the E4 gene which was generated by PCR amplification from 
pBHGIO using the following oligonucleotide sequences: 

5' TGTACACCG GATCCGGCGCACACC3' SEQ ID NO: 17; and 
5 5'CACAACGAGCTC AATTAATTAATTGCCACATCCTC3' SEQ ID NO: 18. 

These primers incorporated sites for Pad and BamHI. Cloning this fragment into 
the Pad and blunt ended Sail sites of the pi 1 .3 backbone resulted in a 
substitution of the fused ITRs, E4 region and fiber gene present in pBHGIO, by 
the ITRs and E4 region alone. The resulting pi 1.3 plasmid containing the ITR 
10 and E4 regions, designated plasmid pDV43a, was then digested with BamHI. 

This BamHI fragment was then used to replace a BamHI fragment in pBHGIO 

thereby creating pDV44 in a pBHGIO backbone. 

In an alternative approach to preparing pDV44 with an additional 
subcloning step to facilitate the incorporation of restriction cloning sites, the 
15 following cloning procedure was performed. pDV44 as above was constructed 
by removing the fiber gene and some of the residual E3 sequences from pBHGIO 
(Microbix Biosystems). As above, to simplify manipulations, the 1 1.9 kb BamHI 
fragment including the rightmost part of the Ad5 genome was removed from 
pBHGIO and inserted into pBS/SK. The resulting plasmid was termed pi 1 .3. 

20 The 3.4 kb DNA fragment corresponding to the E4 region and both ITRs of 
adenovirus type 5 was amplified as described above from pBHGIO using the 
oligonucleotides listed above and subcloned into the vector pCR2.1 (Invitrogen) 
to create pDV42. This step is the additional cloning step to facilitate the 
incorporation of a Sail restriction site. pDV42 was then digested with Pad, 

25 which cuts at a unique site (bold type) in one of the PCR primers, and with Sail, 
which cuts at a unique site in the pCR2.1 polylinker. This fragment was used to 
replace the corresponding Pacl/Xhol fragment of pH. 3 (the pBS polylinker 
adjacent to the Ad DNA fragment contains a unique Xhol site), creating pDV43. 

30 A plasmid designated pDV44 was constructed by replacing the 1 1 .9 kb 

BamHI fragment of pBHGIO by the analogous BamHI fragment of pDV43. As 
generated in the first procedure, pDV44 therefore differs from pBHGIO by the 
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deletion of Ad5 nucleotides 30819:32743 (residual E3 sequences and all but the 
3'-most 41 nucleotides of the fiber open reading frame). 

Thus, to summarize, the cloning procedures described above result in the 
production of a fiber-deleted Ad5 genomic plasmid (pDV44) that was 
5 constructed by removing the fiber gene and some of the residual E3 sequences 
from pBHGIO. pDV44 contains a wild-type E4 region, but only the last 41 
nucleotides of the fiber ORF (this sequence was retained to avoid affecting 
expression of the adjacent E4 transcription unit). Plasmids pBHGIO and pDV44 
contain unpackageable Ad5 genomes, and must be rescued by cotransfection 
10 and subsequent homologous recombination with DNA carrying functional 

packaging signals. In order to generate vectors marked with a reporter gene, 
either pDV44 or pBHGIO was cotransfected with pAEIBBgal, which contains 
the left end of the Ad5 genome with an SV40-driven IS-galactosidase reporter 
gene inserted in place of the El region. 

15 In general, and as described below, the method for virus production by 

recombination of plasmids followed by complementation in cell culture involves 
the isolation of recombinant viruses by cotransfection of any one of the 
adenovirus packaging cell systems prepared in Example 1, namely 211 A, 21 IB, 
21 1R, A549, Vero cells, and the like, with plasmids carrying sequences 

20 corresponding to viral gene delivery vectors. 

A selected cell line is plated in dishes and cotransfected with pDV44 and 
pAEIB/? gal using the calcium phosphate method as described by Bett et al., 

Proc. Natl. Acad. Sci., USA, 91: 8802-8806 (1994). Recombination between the 
overlapping adenovirus sequences in the two plasmids leads to the creation of a 
25 full-length viral chromosome where pDV44 and pAEI Bp gal recombine to form a 
recombinant adenovirus vector having multiple deletions. The deletion of El and 
of the fiber gene from the viral chromosome is compensated for by the 
sequences integrated into the packaging cell genome, and infectious virus 
particles are produced. The plaques thus generated are isolated and stocks of 
30 the recombinant virus are produced by standard methods. 

Because of the fiber deletion, a pDV44-derived virus is 
replication-defective, cells in which it is grown must complement this defect. 
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The 21 IB cell line (a derivative of 293 cells which expresses the wild-type (wt) 
AD5 fiber and is equivalent to 21 1 A on deposit with ATCC as described in 
Example 4) was used for rescue and propagation of the virus described here. 
pDV44 and pAEIBgal were cotransfected into 21 IB cells, and the monolayers 
5 were observed for evidence of cytopathic effect (CPE). Briefly, for virus 

construction, cells were transfected with the indicated plasmids using the Gibco 
Calcium Phosphate Transfection system according to the manufacturer's 
instructions and observed daily for evidence of CPE. 

One of a total of 58 transfected dishes showed evidence of spreading cell 
10 death at day 15. A crude freeze-thaw lysate was prepared from these cells and 
the resulting virus (termed Ad5.Bgal.AF) was plaque purified twice and then 
expanded. To prepare purified viral preparations, cells were infected with the 
indicated Ad and observed for completion of CPE. Briefly, at day zero, 21 1 B 
cells were plated in DMEM plus 10% fetal calf serum at approximately 1 X 10 7 
1 5 cells/1 50 cm 2 flask or equivalent density. At day one, the medium was replaced 
with one half the original volume of fresh DMEM containing the indicated Ad, in 
this case Ad5.Sgal.AF, at approximately 100 particles/cell. At day two, an equal 
volume of medium was added to each flask and the cells were observed for CPE. 
Two to five days after infection, cells were collected and virus isolated by lysis 
20 via four rapid freeze-thaw cycles. Virus was then purified by centrifugation on 
preformed 1 5-40% CsCI gradients (1 1 1,000 x g for three hours at 4°C). The 
bands were harvested, dialyzed into storage buffer (10 mM.Tris-pH 8.1, 0.9% 
NaCI, and 10% glycerol), aliquoted and stored at - 70°C. Purified Ad5.Sgal.AF 
virus particles containing human adenovirus Ad5.8gal.AFgenome (described 
25 further below) have been deposited with the ATCC on January 15, 1999 as 
further described in Example 4. 

For viral titering, as necessary in the below Examples, Ad preparations 
were titered by plague assay on 21 IB cells. Cells were plated on 
polylysine-coated 6 well plates at 1 .5 x 10 6 cells/well. Duplicate dilutions of 
30 virus stock were added to the plates in 1 ml/well of complete DMEM. After a 
five hour incubation at 37 °C, virus was removed and the wells overlaid with 2 
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mi of 0.6% low-melting agarose in Medium 1 99 (Gibco). Ah additional 1 ml of 

overlay was added at five day intervals. 

As a control, the first-generation virus Ad5.fc gal.wt, which is identical to 
Ad5.Bgal.AF except for the fiber deletion, was constructed by cotransfection of 
5 pBHGIO.and pAEI BBgal. In contrast to the low efficiency of recovery of the 
fiberless genome (1/58 dishes), all of 9 dishes cotransfected with pAEI B/?gal 
and pBHGIO produced virus. 

In another embodiment, a delivery plasmid is prepared that does not 
require the above-described recombination events to prepare a viral vector 
10 having a fiber gene deletion. In one embodiment, a single delivery plasmid 

containing all the adenoviral genome necessary for packaging but lacking the 
fiber gene is prepared from plasmid pFG140 containing full-length Ad5 that is 
commercially available from Microbix. The resultant delivery plasmid referred to 
as pFG140-f is then used with pCLF stably integrated cells as described above to 
15 prepare a viral vector lacking fiber. For genetic therapy, the fiber gene can be 
replaced with a therapeutic gene of interest for preparing a therapeutic delivery 
adenoviral vector. Methods for producing a fiberless vector with a complete TPL 
are described in Example 3. 

Vectors for the delivery of any desired gene and preferably a therapeutic 
20 gene are prepared by cloning the gene of interest into the multiple cloning sites 
in the polylinker of commercially available pAEIsplB (Microbix Biosystems), in 
an analogous manner as performed for preparing pE1B0 gal as described above. 
The same cotransfection and recombination procedure is then followed as 
described herein to obtain viral gene delivery vectors as further discussed in later 
25 Examples. 

1. Characterization of the Ad5.jffgal.AF Genome 

To confirm that the vector genomes had the proper structures and that 
the fiber gene was absent from the Ad5.Bgal.AF chromosome, the DNA isolated 
from viral particles was analyzed. Briefly, purified viral DNA was obtained by 
30 adding 10 pi of 10 mg/ml proteinase K, 40 pi of 0.5 M EDTA and 50 pi of 10% 
SDS to 800 pi of adenovirus-containing culture supernatant. The suspension 
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was then incubated at 55 °C for 60 minutes. The solution was then extracted 
once with 

400 //I of a 24:1 mixture of chloroformiisoamyl alchohol. The aqueous phase 
was then removed and precipitated with sodium acetate/ethanol. The pellet was 
5 washed once with 70% ethanol and lightly dried. The pellet was then 

suspended in 40 fj\ of 10 mM Tris-HCI, pH 8.0, 1 mM EDTA. Genomic DNA 
from Ad5.Ggal.wt and Ad5.Ggal.AF produced the expected restriction patterns 
following digestion with either EcoRI or with Ndel. Southern blotting, performed 
with standard methods, with labeled fiber DNA as a probe demonstrated the 
10 presence of fiber sequence in Ad5.Ggal.wt but not in Ad5.Ggal.AF DNA. As a 
positive control, the blot was stripped and reprobed with labeled E4 sequence. 
Fiber and E4 sequences were detected by using labeled inserts from pCLF and 
pE4/Hygro, respectively. E4 signal was readily detectable in both genomes at 
equal intensities. The complete nucleotide sequence of Ad5.Ggal.AF is presented 
15 in SEQ ID NO: 23 and is contained in the virus particle on deposit with ATCC. 

2. Characterization of the Fiberless Adenovirus Ad5./?gal.AF 
To verify that Ad5.Ggal.AF was fiber-defective, 293 cells (which are 
permissive for growth of El -deleted Ad vectors but do not express fiber) were 
infected with Ad5.Ggal.AF or with Ad5.Ggal.wt. Twenty-four hours post 
20 infection, the cells were stained with polyclonal antibodies directed either against 
fiber or against the penton base protein. Cells infected with either virus were 
stained by the anti-penton base antibody, while only cells infected with the 
Ad5.Ggal.wt control virus reacted with the anti-fiber antibody. This confirms 
that the fiber-deleted Ad mutant does not direct the synthesis of fiber protein. 

25 3. Growth of the Fiber-Deleted Ad5./3gal.AF Vector in 

Complementing Cells 

Ad5.Ggal.AF was found to readily be propagated in 21 IB cells. As 
assayed by protein concentration, CsCI-purified stocks of either Ad5.Ggal.AF or 
Ad5.Ggal.wt contained similar numbers of viral particles. The particles appeared 
30 to band normally on CsCI gradients. Infectivity of the Ad6.Ggal.AF particles was 
lower than the Ad5.Ggal.wt control, as indicated by an increased particle/PFU 
ratio. Ad5.Ggal.AF was also found to plaque more slowly than the control 
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virus. When plated on 21 IB cells, Ad5.Bgal.wt plaques appeared within 5-7 
days, while plaques of Ad5.Bgal.AF continued to appear until as much as 15-18 
days post infection. Despite their slower formation, the morphology of 
Ad5.Bgal.AF plaques was essentially normal. 

5 4. Production of Fiberless Ad5.Bgal.AF Particles 

As Ad5.Bgal.AF represents a true fiber null mutation' and its stocks are 
free of helper virus, the fiber mutant phenotype was readily investigated. A 
single round of growth in cells (such as 293) which do not produce fiber 
generating a homogeneous preparation of fiberless Ad allowed for the 
10 determination of whether such particles would be stable and/or infectious. 

Either Ad5.Bgal.wt or Ad5.Sgal.AF was grown in 293 or 21 IB cells, and the 
resulting particles purified on CsCI gradients as previously described. 

Ad5.Bgal.AF particles were readily produced in 293 cells at approximately the 
same level as the control virus and behaved similarly on the gradients, indicating 
15 that there was not a gross defect in morphogenesis of fiberless capsids. 

Particles of either virus contained similar amounts of penton base 
regardless of the cell type in which they were grown. This demonstrated that 
fiber is not required for assembly of the penton base complex into virions. The 
Ad5.Bgal.AF particles produced in 293 cells did not contain fiber protein. 

20 21 I B-grown Ad5.Bgal.AF also contained less fiber than the Ad5.Sgal.wt control 

vims. The infectivities of the different viral preparations on epithelial cells 
correlated with the amount of fiber protein present. The fiberless Ad particles 
were several thousand-fold less infectious than the first-generation vector control 
on a per-particle basis, while infectivity of 21 IB-grown Ad5.Bgal.AF was only 
25 50-100 fold less than that of Ad5.Bgal.wt. These studies confirmed fiber's 

crucial role in infection of epithelial cells via CAR binding. 

5. Composition and Structure of the Fiberless Ad5.Bgal.AF 
Particles 

The proteins contained in particles of 293-grown Ad5.Bgal.AF were 
30 compared to those in Ad5.Ggal.wt, to determine whether proteolysis or particle 
assembly was defective in this fiber null mutant. The overall pattern of proteins 
in the fiberless particles was observed to be quite similar to that of a 
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first-generation vector, with the exception of reduced intensity of the composite 
band resulting from proteins Ilia and IV (fiber). The fiberless particles also had a 
reduced level of protein VII. Although substantial amounts of uncleaved 
precursors to proteins VI, VII, and VIII were not seen, it is possible that the 
5 low-molecular weight bands migrating ahead of protein VII represent either 
aberrantly cleaved viral proteins or their breakdown products. 

Cryo-electron microscopy was used to more closely examine the structure 
of the 293 grown Ad5.fcgal.AF and of Ad5fcgal.wt. The fiber, having an 
extended stalk with a knob at the end, was faintly visible in favorable 
10 orientations of wild-type Ad5 particles, but not in images of the fiberless 

particles. Filamentous material likely corresponding to free viral DNA was seen 
in micrographs of fiberless particles. This material was also present in 
micrographs of the first-generation control virus, albeit at much lower levels. 

Three-dimensional image reconstructions of fiberless and wild-type 
15 particles at -20 A resolution showed similar sizes and overall features, with the 
exception that fiberless particles lacked density corresponding to the fiber 
protein. The densities corresponding to other capsid proteins, including penton 
base and proteins Ilia, VI, and IX, were comparable in the two structures. This 
confirms that absence of fiber does not prevent assembly of these components 
20 into virions. The fiber was truncated in the wild-type structure as only the lower 
portion of its flexible shaft follows icosahedral symmetry. The RGD protrusions 
on the fiberless penton base were angled slightly inward relative to those of the 
wild-type structure. Another difference between the two penton base proteins 
was that there is a —30 A diameter depression in the fiberless penton base 
25 around the five-fold axis where the fiber would normally sit. The Ad5 

reconstructions confirm that capsid assembly, including addition of penton base 
to the vertices, is able to proceed in the complete absence of fiber. 

6. Integrin-Dependent Infectivity of Fiberless Ad5.figal.AF 
Particles 

30 While attachment via the viral fiber protein is a critical step in the 

infection of epithelial cells, an alternative pathway for infection of certain 
hematopoietic cells has been described. In this case, penton base mediates 
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binding to the cells (via B2 integrins) and internalization (through interaction with 
ovintegrins). Particles lacking fiber might therefore be expected to be 
competent for infection of these cells, even though on a per-particle basis they 
are several thousand-fold less infectious than normal Ad vectors on epithelial 

5 cells. 

To investigate this, THP-1 monocytic cells were infected with 
Ad5.ISgal.wt or with Ad5.ISgal.AF grown in the absence of fiber. Infection of 
THP-1 cells was assayed by infecting 2 x 10 s cells at the indicated m.o.i. in 0.5 
ml of complete RPMI. Forty-eight hours post-infection, the cells were fixed with 
10 glutaraldehyde and stained with X-gal, and the percentage of stained cells was 
determined by light microscopy. The results of the infection assay showed that 
the fiberless particles were only a few-fold less infectious than first-generation 
Ad on THP-1 cells. Large differences were seen in plaquing efficiency on 
epithelial (21 1 B) cells. Infection of THP-1 cells by either Ad5.Bgal.AF or 
15 Ad5.Bgal.wt was not blocked by an excess of soluble recombinant fiber protem, 
but could be inhibited by the addition of recombinant penton base). These 
results indicate that the fiberless Ad particles use a fiber-independent pathway to 
infect these cells. Furthermore, the lack of fiber protein did not prevent 
Ad5.BgalAF from internalizing into the cells and delivering its genome to the 
20 nucleus, demonstrating that fiberless particles are properly assembled and are 
capable of uncoating. 

The foregoing results with the recombinant viruses thus produced 
indicates that they can be used as gene delivery tools in cultured cells and in 
vivo as described more fully in the Examples. For example, for studies of the 
25 effectiveness and relative immunogenicity of multiply-deleted vectors, virus 

particles are produced by growth in the packaging lines described in Example 1 
and are purified by CsCI gradient centrifugation. Following titering, virus 
particles are administered to mice via systemic or local injection or by aerosol 
delivery to lung. The LacZ reporter gene allows the number and type of cells 
30 which are successfully transduced to be evaluated. The duration of transgene 
expression is evaluated in order to determine the long-term effectiveness of 
treatment with multiply-deleted recombinant adenoviruses relative to the 
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MI 8369 and Ml 241 1 . Oligonucleotide primers are designed to amplify the 
entire coding sequence of the full-length fiber genes, starting from the start 
codon, ATG, and ending with the termination codon TAA. For cloning purposes, 
the 5' and 3' primers contain the respective restriction sites BamHI and Notl for 
5 cloning into pcDNA plasmid as described in Example 1 A. PCR is performed as 
described above. 

The resulting products are then used to construct chimeric fiber 
constructs by PCR gene overlap extension (Horton et a/. (1990) BioTechniques, 
8:525-535). The Ad5 fiber tail and shaft regions (5TS; the nucleotide region 
10 encoding amino acid residue positions 1 to 403) are connected to the Ad3 fiber 
head region (3H; the nucleotide region encoding amino acid residue positions 
136 to 319) to form the 5TS3H fiber chimera. Conversely, the Ad3 fiber tail and 
shaft regions (3TS; the nucleotide region encoding amino acid residues positions 
1 to 135) are connected to the Ad5 fiber head region (5H; the nucleotide region 
15 encoding the amino acid residue positions 404 to 581) to form the 3TS5H fiber 
chimera. The fusions are made at the conserved TLWT (SEQ ID NO: 19) 
sequence at the fiber shaft-head junction. 

The resultant chimeric fiber PCR products are then digested with BamHI 
and Notl for separate directional ligation into a similarly digested pcDNA 3.1. 

20 The TPL sequence is then subcloned into the BamHI as described in Example 1A 
for preparing an expression vector for subsequent transfection into 21 1 cells as 
described above or into the alternative packaging cell systems as previously 
described. The resultant chimeric fiber construct-containing adenoviral 
packaging cell lines are then used to complement adenoviral delivery vectors as 
25 previously described. Other fiber chimeric constructs are obtained with the 
various adenovirus serotypes using a similar approach. 

In an alternative embodiment, the use of modified proteins including with 
modified epitopes (see, e.g., Michael et si. (1995) Gene Therspy, 2.660-668 
and International PCT application Publication No. WO 95/2641 2, which describe 
30 the construction of a cell-type specific therapeutic viral vector having a new 

binding specificity incorporated into the virus concurrent with the destruction of 
the endogenous viral binding specificity). In particular, the authors described the 
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production of an adenoviral vector encoding a gastrin releasing peptide (GRP) at 
the 3' end of the coding sequence of the Ad5 fiber gene. The resulting fiber- 
GRP fusion protein was expressed and shown to assemble functional fiber 
trimers that were correctly transported to the nucleus of HeLa cells following 
5 synthesis. 

Similar constructs are contemplated for use in the complementing 
adenoviral packaging cell systems for generating new adenoviral gene delivery 
vectors that are targetable, replication-deficient and less immunogenic. 
Heterologous ligands contemplated for use herein to redirect fiber specificity 
10 range from as few as 10 amino acids in size to large globular structures, some of 
which necessitate the addition of a spacer region so as to reduce or preclude 
steric hindrance of the heterologous ligand with the fiber or prevent trimerization 
of the fiber protein. The ligands are inserted at the end or within the linker 
region. Preferred ligands include those that target specific cell receptors or 
15 those that are used for coupling to other moieties such as biotin and avidin. 

A preferred spacer includes a short 1 2 amino acid peptide linker 
composed of a series of serines and alanine flanked by a proline residue at each 
end using routine procedures known to those of skill in the art. The skilled 
artisan will be with the preparation of linkers to accomplish sufficient protein 
20 presentation and to alter the binding specificity of the fiber protein without 
compromising the cellular events that follow viral internalization. Moreover, 
within the context of this disclosure, preparation of modified fibers having 
ligands positioned internally within the fiber protein and at the carboxy terminus 
as described below are contemplated for use with the methods described herein. 
25 

The preparation of a fiber having a heterologous binding ligand is prepared 
essentially as described in the above-cited paper. Briefly, for the ligand of 
choice, site-directed mutagenesis is used to insert the coding sequence for a 
linker into the 3' end of the Ad5 fiber construct in pCLF as prepared in Example 
30 1. 

The 3' or antisense or mutagenic oligonucleotide encodes a preferred 
of ProSerAlaSerAlaSerAlaSerAlaProGlySer (SEQ ID NO: 20) 



linker sequence 
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followed by a unique restriction site and two stop codons, respectively, to allow 
the insertion of a coding sequence for a selected heterologous ligand and to 
ensure proper translation termination. .Flanking this linker sequence, the 
mutagenic' oligonucloetide contains sequences that overlap with the vector 
5 sequence and allow its incorporation into the construct. Following mutagenesis 
of the pCLF sequence adding the linker and stop codon sequences, a nucleotide 
sequence encoding a preselected ligand is obtained, linkers corresponding to the 
unique restriction site in the modified construct are attached and then the 
sequence is cloned into linearized corresponding restriction site. The 
10 resultant fiber-ligand construct is then used to transfect 21 1 or the alternative 
cell packaging systems previously described to produce complementing viral 
vector packaging systems. 

In a further embodiment, intact fiber genes from different Ad serotypes 
are expressed by 21 1 cells or an alternative packaging system as previously 
15 described. A gene encoding the fiber protein of interest is first cloned to create 
a plasmid analogous to pCLF, and stable cell lines producing the fiber protein are 
' generated as described above for Ad5 fiber. The adenovirus vector described 
which lacks the fiber gene is then propagated in the cell line producing the fiber 
protein relevant for the purpose at hand. As the only fiber gene present is the 
20 one in the packaging cells, the adenoviruses produced contain only the fiber 
protein of interest and therefore have the binding specificity conferred by the 
complementing protein. Such viral particles are used in studies such as those 
described above to determine their properties in experimental animal systems. 

EXAMPLE 3 

25 Tripartite leader sequences (TPLs) that are useful in enhancing the 

expression of complementing adenoviral proteins, particularly fiber protein, for 
use in preparing an adenoviral gene delivery vector are provided. The complete 
Ad5 TPL was constructed by assembling PCR fragments. First, the third TPL 
exon (exon 3) (nt 9644-9731 of the Ad5 genome) was amplified from Ad5 
30 genomic DNA using the synthetic oligonucleotide primers 

5'CTCAACAATTGTGGATCCGTACTCC3'(SEQ ID No. 24) and 
5 'GTGCTCAGCAG ATCTT GCG ACT GTG3' (SEQ ID No. 25). The resulting 




WO 01/83729 



PCT/EP01/04863 



-73- 

product was cloned to the BamHI and Bglll sites of pAEISpla (Microbix 
Biosystems) using sites in the primers (shown in bold) to create plasmid pDV52. 

A fragment corresponding to the first TPL exon (exon 1), the natural first intron 
(intron 1), and the second TPL exon (exon 2) (Ad5 nt 6049-7182) was then 
5 amplified using primers 5 'GGCGCGTTCGGATCCACTCTC TTCC3' (SEQ ID No. 

26) and 5'CTACATGCTAGGCAGATCTCGTTCGGAG3' (SEQ ID No. 27),and 
cloned into the BamHI site of pDV52 (again using sites in the primers) to create 
pDV55. 

This plasmid contains a 1 .2 kb BamHI/Bglll fragment containing the first 
10 TPL exon, the natural first intron, and the fused second and third TPL exons. 

The nucleotide sequence of the complete TPL containing the noted 5' and 3' 
restriction sites is shown in SEQ ID No 28 with the following nucleotide regions 
identified: 1-6 nt BamHI site; 7-47 nt first leader segment (exon 1); 48-1068 nt 
natural first intron (intron 1); 1069-1 140 nt second leader segment (exon 2), 

15 1 141-1 146 nt fused BamHI and Bglll sites; 1147-1 234 nt third leader segment 

(exon 3); and 1 235-1 240 nt Bglll site. 

EXAMPLE 4 

Deposit of Materials 

The following cell lines and plasmids were deposited on September 25, 

20 1996, with the American Type Culture Collection, 10801 University Blvd, 

Manassas, Virginia, USA (ATCC) under the provisions of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure and the Regulations thereunder (Budapest Treaty) :Plasmid 
pE4/Hygro (accession number 97739), Plasmid pCLF (accession number 97737), 
25 211 Cell Line (accession number CRL-12193) and 211 A Cell Line (accession 

number CRL-1 21 94) 

The following virus, Ad5./?gal.AF, was deposited on January 15, 1999, 
with the ATCC as listed above and provided with accession number VR2636. 

Additionally, plasmids pDV60, pDV67, pDV69, pDV80 and pDV90 were 
30 deposited at the ATCC on January 5, 2000 and provided with accession 
numbers PTA-1144, PTA-1145, PTA-1146, PTA-1147 and PTA-1148 
respectively. 
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EXAMPLE 5 

Preparation and Use of Adenoviral Packaging Cell Lines 
Containing Plasmids Containing Alternative TPLs 

Plasmids containing tripartite leaders (TPLs) have been constructed. The 
5 resulting plasmids that contain different selectable markers, such as neomycin 
and zeocin, were then used to prepare fiber-complementing stable cell lines for 
use as for preparing adenoviral vectors. 

A. pDV60 

Plasmid pDV60 was constructed by inserting this TPL cassette of SEQ ID 
10 No. 28 into the BamHI site upstream of the Ad5 fiber gene in pcDNA3/Fiber, a 
neomycin selectable plasmid (see, e.g., U.S. application Serial No. 09/482,682 
(also filed as International PCT application No. PCT/USOO/00265 on January 14, 
2000); see also Von Seggern et al. 

(1998) J. Gen Virol., 79: 1461-1468). The nucleotide sequence of pDV60 is 
15 listed in SEQ ID NO: 29. Plasmid pDV60 has been deposited in the ATCC under 
accession number PTA-1144. 

B. pDV61 

To construct pDV61, an Asp71 8/Not! fragment containing the CMV 
promoter, partial Ad5 TPL, wildtype Ad5 fiber gene, and bovine growth hormone 
20 terminator was transferred from pCLF (ATCC accession number 97737; and 
described in copending U.S. application Serial No. 09/482,682 (also filed as 
International PCT application No. PCT/US00/00265 on January 14, 2000);), to a 
zeocin selectable cloning vector referred to as pCDNA3.1/Zeo ( + ) (commerically 
available from Invitrogen and for which the sequence is known). 

25 C. pDV67 

In an analogous process, pDV67 containing complete TPL was 
constructed by transferring an Asp 718/Xbal fragment from pDV60 into 
pcDNA3.1/Zeo( + ) backbone. The nucleotide sequence of pDV67 is set forth in 
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SEQ ID No. 30. Plasmid pDV67 is available from the ATCC under accession 

number PT A- 11 45. 

D. pDV69 

To prepare pDV69 containing a modified fiber protein, the chimeric 
5 Ad3/Ad5 fiber gene was amplified from pGEM5TS3H (Stevenson eta/. (1995) J. 
Virol., 69: 2850-2857) using the primers 5'ATGGGAT 
CAAGATGAAGCGCGCAAGACCG3' (SEQ ID No. 31) and 

5'CACTATAGCGGCCGCATTCTCAGTCATCTT3' (SEQ ID No. 32), and cloned to 
the BamHl and Notl sites of pcDNA3.1 /Zeo( + ) via new BamHI and Notl sites 
10 engineered into the primers to create pDV68. Finally, the complete TPL 

fragment described above was then added to the unique BamHI site of pDV68 
to create pDV69. The nucleotide sequence of pDV69 is listed in SEQ ID No. 33 
and has been deposited in the ATCC under accession number PTA-1 146. 

E. Preparation of Stable Adenovirus Packaging Cell Lines 

15 El -2a S8 cells are derivatives of the A549 lung carcinoma line (ATCC # 

CCL 185) with chromosomal insertions of the plasmids pGRE5-2.E1 (also 
referred to as GRE5-E1-SV40-Hygro construct and listed in SEQ ID No. 34) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 35), which provide complementation of the adenoviral El and E2a 
20 functions, respectively. This line and its derivatives were grown in Richter's 
modified medium (BioWhitaker) + 10% FCS. El -2a S8 cells were 
electroporated as previously described (Von Seggern eta/. (1998)«7. Gen Virol. , 
79: 1461-1468) with pDV61, pDV67, or with pDV69, and stable lines were 
selected with zeocin (600 //g/ml). 

25 The cell line generated with pDV61 is designated 601. The cell line 

generated with pDV67 is designated 633 while that generated with pDV69 is 
designated 644. Candidate clones were evaluated by immunofluorescent 
staining with a polyclonal antibody raised against the Ad2 fiber. Lines 
expressing the highest level of fiber protein were further characterized. 

30 For the S8 cell complementing cell lines, to induce El expression, 0.3 //M 

of dexamethasone was added to cell cultures 16-24 hours prior to challenge 
with virus for optimal growth kinetics. For preparing viral plaques, 5 X 10 s 
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cells/well in 6 well plates are prepared and pre-induced with the same 
concentration of dexamethasone the day prior to infection with 0.5 //M included 
at a final concentration in the agar overlay after infection. 

F. Development of Cell Lines for Complementation of E17E2a' Vectors 
5 The Adenovirus 5 genome was digested with Seal enzyme, separated on 

an agarose gel, and the 6,095 bp fragment containing the left end of the virus 
genome was isolated. The complete Adenovirus 5 genome is registered as 
Genbank accession #M73260, incorporated herein by reference, and the virus is 
available from the American Type Culture Collection, Manassas, Virginia, U.S.A., 
10 under accession number VR-5. The Seal 6,095 bp fragment was digested 

further with Clal at bp 917 and Bglll at bp 3,328. The resulting 2,41 1 bp Clal to 
Bgill fragment was purified from an agarose gel and ligated into the superlinker 
shuttle plasmid P SE280 (Invitrogen, San Diego, CA), which was digested with 
Clal and Bglll, to form pSE280-E. 

15 Polymerase chain reaction (PCR) was performed to synthesize DNA 

encoding an Xhol and Sail restriction site contiguous with Adenovirus 5 DNA bp 
552 through 924. The primers which were employed were as follows: 

5' end, Ad5 bp 552-585: 

5'-GTCACTCGAGGACTCGGTC-GACTGAAAATGAGACATATTATCTGCCACGGA 

20 CC-3' (SEQ ID No 36) 

3' end, Ad5 bp 922-891: 

5'-CGAGATCGATCACCTCCGGTACAAGGTTTGGCATAG-3' (SEQ ID No. 37) 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment A) then was digested with Xhol and Clal, which cleaves at the native 
25 Clal site (bp 917), and ligated to the Xhol and Clal sites of pSE280-E, thus 

reconstituting the 5' end of the El region beginning 8 bp upstream of the ATG 

codon. 

PCR then was performed to amplify Adenovirus 5 DNA from bp 3,323 
through 4,090 contiguous with an EcoRI restriction site. The primers which 
30 were employed were as follows: 

5' end, Ad5 bp 3323-3360: 
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5 *.CATGAAG AT CTGGAAGGTGCTGAGGTACGATGAG ACC-3' (SEQ ID No. 38); 

and 

3' end. Ad5 bp 4090-4060: 

5'-GCGACTTAAGCAGTCAGCTG-AGACAGCAAGACACTTGCTTGATCCAAATCC 

5 -3' (SEQ ID No. 39). 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment B) was digested with Bglll, thereby cutting at the Adenovirus 5 Bglll 
site (bp 3,382) and EcoRI, and ligated to the Bglll and EcoRI sites of pSE280-AE 
to reconstruct the complete El a and Elb region from Adenovirus 5 bp 552 
10 through 4,090. The resulting plasmid is designated pSE280-E1. 

A construct containing the intact Ela/b region under the control of the 
synthetic promoter GRE5 was prepared as follows. The intact Ela/b region was 
excised from pSE280-E1, which was modified previously to contain a BamHI site 
3' to the El gene, by digesting with Xhol and BamHI. The Xhol to BamHI 
15 fragment containing the El a/b fragment was cloned into the unique Xhol and 
BamHI sites of pGRE5-2/EBV (U.S. Biochemicals, Cleveland, Ohio) to form 
pGRE5-E1 ). 

Bacterial transformants containing the final construct were identified. 
Plasmid DNA was prepared and purified by banding in CsTFA prior to use for 
20 transfection of cells. 

Construction of plasmid including Adenovirus 5 E2A sequence. 

The Adenovirus 5 genome was digested with BamHI and Spel, which cut 
at bp 21,562 and 27,080, respectively. Fragments were separated on an 
agarose gel and the 5,518 bp BamHI to Spel fragment was isolated. The 5,518 
25 bp BamHI to Spel fragment was digested further with Smal, which cuts at bp 
23,912. The resulting 2,350 bp BamHI to Smal fragment was purified from an 
agarose gel, and ligated into the superlinker shuttle plasmid pSE280, and 
digested with BamHI and Smal to form pSE280-E2 BamHI-Smal. 

PCR then was performed to amplify Adenovirus 5 DNA from the Smal site 
30 at bp 23,91 2 through 24,730 contiguous with Nhel and EcoRI restriction sites. 
The primers which were employed were as follows: 

5' end, Ad5 bp 24,732-24,708: 
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5 '-CACGAATTCGTCAGCGCTTCTCGTCGCGTCCAAGACCC-3' (SEQ ID No. 40), 

3' end, Ad5 bp 23,912-23,934: 

5 '-CACCCCGGGGAGGCGGCGGCGACGGGGACGGG-3' (SEQ ID No. 41) 

This amplified DNA fragment was digested with Smal and EcoRI, and 
5 ligated to the Smal and EcoRI sites of P SE280-E2 Bam-Sma to reconstruct the 
complete E2a region from Ad5 bp 24,730 through 21,562. The resulting 
construct is pSE280-E2a. 

In order to convert the BamHI site at the 3' end of E2a to a Sail site, the 
E2a region was excised from pSE280-E2a by cutting with BamHI and Nhel, and 
10 recloned into the unique BamHI and Nhel sites of pSE280. Subsequently, the 
E2a region was excised from this construction with Nhel and Sail in order to 
clone into the Nhel and Sail sites of the pMAMneo (Clonetech, Palo Alto, CA) 
multiple cloning site in a 5' to 3' orientation, respectively. The resulting 
construct is pMAMneo E2a. 

1 5 Bacterial transformants containing the final pMAMneo-E2a were 

identified. Plasmid DNA was prepared and purified by banding in CsTFA. 

Circular plasmid DNA was linearized at the Xmnl site within the ampicillin 
resistance gene of pMAMneo-E2a, and further purified by the phenol/chloroform 
extraction and ethanol precipitation prior to use for transfection of cells. 

20 Transfection and selection of cells. 

In general, this process involved the sequential introduction, by calcium 

phosphate precipitation, or other means of DNA delivery, of two plasmid 
constructions each with a different viral gene, into a single tissue culture cell. 
The cells were transfected with a first construct and selected for expression of 
25 the associated drug resistance gene to establish stable integrants. Individual cell 
clones were established and assayed for function of the introduced viral gene. 
Appropriate candidate clones then were transfected With a second construct 
including a second viral gene and a second selectable marker. Transfected cells 
then were selected to establish stable integrants of the second construct, and 
30 cell clones were established. Cell clones were assayed for functional expression 
of both viral genes. 
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A549 (ATCC Accession No. CCL-185) were used for transfection. 
Appropriate selection conditions were established for G418 and hygromycin B by 
standard kill curve determination. 

Transfection of A549 cells with plasmids including El and E2a regions. 

5 pMAMNeo-E2a was linearized with Xmnl with the Amp" gene, introduced 

into cells by transfection, and cells were selected for stable integration of this 
plasmid by G418 selection until drug resistant colonies arose. The clones were 
isolated and screened for E2a expression by staining for E2a protein with a 
polyclonal antiserum, and visualizing by immunofluorescence. E2a function was 
10 screened by complementation of the temperature-sensitive mutant Ad5ts1 25 

virus which contains a temperature-sensitive mutation in the E2a gene. (Van Der 
Vliet, et al., J. Virology, Vol. 15, pgs. 348-354 (1975)). Positive clones 
expressing the E2a gene were identified and used for transfection with the 7 kb 
EcoRV to Xmnl fragment from pGRE5-E1, which contains the GRES promoted 
15 El a/b region plus the hygromycin" gene. Cells were selected for hygromycin 
resistance and assayed for El a/b expression by staining with a monoclonal 
antibody for the El protein (Oncogene Sciences, Uniondale, N.Y.). El function 
was assayed by ability to complement an El-deleted vector. At this point, 
expression and function of E2a was verified as described above, thus 
20 establishing the expression of El a/b and E2a in the positive cell clones. 

A transfected A549 { A549 (ATCC Accession No. CCL-1 85);) cell 

lines showed good El a/b and E2a expression and was selected for further 
characterization. It was designated the S 8 cell line. 

G. Preparation of Adenoviral Vectors Containing Ad5./?gal.AF Genome in S 8 
25 Improved Fiber-Complementing Cell Lines 

To prepare adenoviral vectors containing Ad5.0gal.AF (Ad 5 ./ffgal.AF has 
been was deposited the ATCC under accession number VR2636) in S 8 cells 
containing alternative forms of TPL for enhancing the expression of fiber 
proteins, the protocol as described in Example 2 for preparing Ad 5 ./?gal.AF in 
30 21 1 B cells was followed with the exception of pretreatment with 0.3 pM 

dexamethasone for 24 hours as described above. Thus, viral particles with the 
wildtype Ad5 fiber protein on their surface and containing the fiberless 
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Ad5.Agal.AF genome were produced in 633 cells. Particles produced in 644 
cells also contained the fiberless Ad5.Agal.AF genome, but had the chimeric 
5T3H fiber protein, with the Ad3 fiber knob, on their surface. 

Thus, these viral preparations, prepared as described herein are useful for 
5 targeting delivery of the Ad5.Agal.AF, Ad5.GFP.AF, or other similarly 

constructed fiberless genome with either wild-type or modified fibers. Preferably 
for purposes herein the fibers are from an Ad serotype D virus, more preferably 

from Ad37. 

EXAMPLE 6 



10 



Pseudotyping 

Produced 



and Infectivity of Recombinant Adenoviral Vectors 
with Improved Fiber-Complementing Cell Lines 



A. Pseudotyping of Ad5.Agal.AF 

To verify that adenoviral vectors were produced had altered tropisms, 
viral particles were purified from either 633 or 644 cells and were then Western 
15 blotted and probed with a polyclonal rabbit antibody against the Ad2 fiber 
(which detects the Ad5 and chimeric 5T3H fiber proteins). 

B. infectivity of Cells with 633 or 644 Generated Virus Particles The 

cell lines, 633 or 644, prepared as described above, were infected with the 
indicated number of particles/cell of AdB^aLAF and virus particles produced. 

20 Virus was then used to infect selected cell lines, including 21 1 B, MRC-5 human 
fibroblasts, A-10 rat aortic endothelial cells, and THP-1 human monocytic cells. 
Unbound virus was removed by washing the cells and the cells were further 
incubated at 37°C for 48 hours. Cells were then fixed with glutaraldehyde and 
stained with X-gal. The percentage of stained cells was then determined by light 
25 microscopy where all experiments were done in triplicate. 

The results indicated that adenoviral vectors could be retargeted by 
pseudotyping using packaging cell lines expressing different fiber proteins. 
Particles containing either fiber were equally infectious on 21 IB cells, while 
MRC-5 fibroblasts and THP-1 cells were more readily infected by virus containing 
30 the chimeric fiber. The A-10 rat endothelial cells were more readily infected by 
particles containing the wildtype Ad5 fiber protein. 




WO 01/83729 



PCT/EPO 1/04863 



-81- 



EXAMPLE 7 

Transient Transcomplementation 

The ability of adenovirus type 5 (Ad5) to deliver therapeutic genes to cells 
is mediated by the interaction of the adenoviral fiber protein with the 
5 coxsackievirus-adenoviral receptor (CAR). Because a wide-range of cells express 
CAR, it was thought that it would be difficult to use adenoviruses to deliver 
genes to specific cell types. A system for testing modified fiber genes to 
identify tropisms of interest is described in copending U.S. application Serial 
No. 09/482,682 (also filed as International PCT application No. 

10 PCT/US0O/00265 on January 14, 2000). An in vitro system has been 

developed that involves infection of tissue culture cells with a fiber^leleted Ad 
and transient co-transfection with a plasmid directing fiber expression. This 
system allows one to produce and evaluate modified fibers expressed on a viral 
particle. This system can be used to produce therapeutic quantities of 
15 adenoviral vectors with modified fiber proteins, with such fibers having a new 
tropism added by insertion of a desired ligand into the fiber gene. These fibers 
may also have the natural tropism (i.e. binding to CAR) ablated. 

Plasmids used were pDV60 and pDV55 were prepared as described 
herein and in U.S. application Serial No. 09/482,682 (also filed as International 
20 PCT application No. PCT/US00/00265 on January 1 4, 2000). P DV60 is an 
pcDNA3.1 -based expression plasmid that contains the CMV promoter, Ad5 
tripartite leader, an intron, and the Ad5 fiber gene sequence. P DV55 contains 
no fiber gene and serves as the negative control. Ad5.fta1.AF and 21 IB are 
described above. 293T cells are identical to 293 cells except they express an 
25 integrated SV40 large T antigen gene. HDF cells are human diploid fibroblasts. 
293T cells express CAR and o v integrins; HDF cells express a v integrins but no 
CAR. Transfections with fiber expression plasmids were performed with 
Lipofectamine (GIBCO-BRL) using 20mg DNA and 50ml Lipofectamine per 15cm 
dish. Cells were maintained in DMEM supplemented with 10% fetal bovine 

30 serum. 

The liber deletion mutation of Ad5./lgal.AF is complemented in vans by 
passaging virions through 21 1 B, a cell line that stably expresses functional Ad5 
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fiber. The present system was designed to complement Ad5.£gal.AF by 
modified fibers expressed from transfected episomal plasmids in 293T cells. The 
result is a simplified and rapid method to incorporate modified fibers on a viral 
particle containing the Ad5./Sgal.AF genome that does not require propagation of 

5 the virus. 

The feasibility of transcomplementation of Ad5.0gal.AF with episomal 
fiber-expressing plasmids was demonstrated in the following experiment. 293T 
cells were transfected with one of two plasmids: pDV55, which expresses no 
fiber or pDV60, which expresses wildtype Ad5 fiber. Fiber expression persists 
10 for at least six days. Twenty-four hours after transfection, these cells were 

infected at 2000 particles/cell with Ad5.jffgal.AF passaged through 21 IB cells. 
Seventy-two hours later, a crude viral lysate (CVL) was generated by exposing 
the cells to five freeze-thaw cycles. Viral particles were purified by cesium 
chloride gradient centrifugation. The resulting virions incorporated the fiber 
15 expressed from the episomal plasmid, as confirmed by Western blots performed 

with an antibody specific to the Ad5 fiber. 

Episomal plasmid transcomplementation system is suitable for quickly 
expressing and evaluating the properties of modified fibers in the context of a 
viral particle. Episomal plasmid transcomplementation will also be of great utility 
20 for quickly evaluating a bank of modified fibers for other binding properties, 

including new tropisms and the ablation of the native tropism. In addition to the 
rapid generation and testing of large numbers of modified fibers, there are other 
advantages to the Ad5.£gal.AF transcomplementation system in terms of 
production and safety. Episomal plasmid transcomplementation has the inherent 
25 advantage over transcomplementation in that it is not necessary to make a 

stable cell line for every modified fiber for complementation with Ad5.£gal.AF. 
Because the Ad5./Sgal.AF is deleted in El, E3 and fiber, there is an additional 
gene deletion, which should render it very suitable for gene therapy. In addition, 
the presence of the fiber gene deletion decreases the opportunity to generate 
30 replication-competent virus via recombination in the packaging cells. A single 
Ad vector preparation can be retargeted to any number of different cell types 
simply by transfecting the cells with the appropriate fiber-expression construct. 
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EXAMPLE 8 

Preparation of Adenoviral Gene Delivery Vectors Containing the Ad37 fiber 
protein 

This example describes construction of packaging cell lines expressing 
5 the Ad 3 7 fiber protein, and their use in generating particles of a fiber-deleted Ad 
vector (such as Ad5.jJgal.AF) containing this fiber protein. The fiber protein is 
attached to the viral capsid by binding to the penton base protein through its N- 
terminus, and the Ad37 fiber was modified in order to make its N-termmal 
sequence more closely match that of the Ad5 protein to ensure that it would 
10 efficiently bind the Ad5 penton base in these vectors. 

A. Materials and methods 

Cell lines and wild-type adenovirus. Human A549 lung carcinoma 
epithelial cells and human Chang C conjunctival cells (American Type Culture 
Collection) were maintained in complete Dulbecco's Modified Eagle Medium 
15 (DMEM) with 10% fetal bovine serum. Wild-type Ad19p and Ad37 (ATCC) were 
propagated in A549 cells and purified by banding on CsCI 2 density gradients as 
previously described (Huang eta/. (1999 ) J. Virol. 73: 2798-2802). Viral protein 
concentration was determined by the Bio-Rad Protein Assay, and was used to 
calculate the number of viral particles based on the known molecular weight of 

20 Ad2 virions (1 fjg = 4 x 10 9 particles). 

B. Construction of the Ad37 fiber expressing cell lines and the recombinant 

Ad37 knob protein. 

1 . Construction of an Expression Plasmid for the Ad37 Fiber Protein 
(pDV80) 

25 The plasmid designated pDV80 (see, SEQ ID No. 42) prepared for 

expression of the Ad37 fiber protein in mammalian cells, uses the same 
regulatory elements as the elements in pDV60, pDV67, and pDV69 to express 
the Ad37 fiber in packaging lines. It was constructed in two steps. 

First, the Ad37 fiber open reading frame was amplified from Ad37 
30 genomic DNA using synthetic oligonucleotide primers, L37: 5' TGT CCT GGA 
TCC AAG ATG AAfi CGC GCC CGC CCC AGC GAA GAT GAC TTC 3'(SEQ ID 
NO. 43) and 37FR: 5' AAA CAC GGC GGC CGC TCT TTC ATT CTT G 3' SEQ 
ID NO. 44). L37 contains nucleotides (underlined) that differ from the Ad37 
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genomic sequence in order to add a unique BamHI site (bold) before the start 
codon (italicized) and to create point mutations that make the N-terminal 
sequence of the fiber more closely match the N-terminal sequence of the Ad5 
fiber protein as follows: 

5 Ad37 MSKRLRVEDDFNPVYPY (SEQ ID No. 45) 

4, 4, 4, 4, 4, 

KRARPS (SEQ ID No. 46) 

Ad5 MKRARPSEDTFNPVYPY (SEQ ID No. 47) . 

37 FR also incorporates a unique Not\ site (bold). The PCR product was inserted 
10 into the BamHI and A/ofl sites of pCDNA3.1zeo( + ) (Invitrogen) to create 
pDV78. The correct sequence of the Ad37 fiber protein, including inserted 

changes, was confirmed by sequencing. 

Two point mutations in the fiber gene in the 705 line, S356 to P356 and 
1362 to T362, were discovered by the sequencing. The mutations are not in the 
15 receptor binding domain in Ad37 fiber gene in the 705 cell line. They are buried 
in the knob trimer interface. To confirm that the these mutations do not affect 
receptor binding, the Ad37 fiber protein with the correct sequence was recloned, 
and 293T cells transfected with the virus and subsequently infected with 
Ad5.GFPAF to produce Ad37 pseudotyped virus. The results were the same as 
20 the results of the experiments with Ad37 pseudotyped virus produced from line 
705 (see, Wu eta/. (2001) Virology 279: 78-89). 

Second, a 1.2 kb Bam HI IBgl II fragment containing an adenovirus type 5 
tripartite leader was excised from P DV55 (see EXAMPLE 3) and inserted into the 
Bam HI site of pDV78 to create P DV80 (SEQ ID No 42). Plasmid pDV80 has 
25 been deposited in the ATCC under accession number PTA-1 147. 

2. Construction of the recombinant Ad37 knob protein 

Recombinant Ad37 knob protein containing an N-terminal T7»Tag was 
produced in E. coli using the PET expression system (Novagen). Ad37 fiber DNA 
(GenBank accession number U69132) was PCR amplified from wild-type Ad37 
30 genomic DNA using the following primers (SEQ ID Nos. 48 and 49): 

5' GGATCC 4 7 GGGATACTTGGTAGCA 3' (BamHI site underlined and 
5' GCAACICGAGTCATTCTTGGGCAATATAGG 3'(Xhol site underlined). 
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The PCR reactions were performed at 94 °C (denaturation), 55 C 
(annealing), 72° C (extension, 30 cycles) using Taq DNA polymerase (Qiagen). 

The amplified DNA fragments, which contained residues 172 to 365 of the Ad37 
fiber protein with the addition of an N-terminal start codon (italicized), were 
5 purified and subcloned into the pCR-TOPO vector using the TA-Cloning Kit 
(Invitrogen). No replication errors were found by DNA sequencing. Plasmids 
from cultured transformed colonies were purified and digested with BamH\ and 
Xho\. The fragment was inserted into the BamH\ and Xho\ sites of the bacterial 
expression vector, P ET21a (Novagen), and transformed into (DE3)pLYS S 
10 expression cells (Invitrogen). Colonies were selected for knob expression by 
induction with 1 mM IPTG for four hours at 37 “C and knob expression was 
determined by SDS-PAGE. The colony displaying highest knob expression was 
used for large-scale knob expression and induced with 0.5 mM IPTG at 30 C for 

four hours. 

15 The recombinant T7»Tagged Ad37 knob protein was purified from 

sonicated bacteria using the T7*Tag Affinity Purification Kit as recommended by 
the manufacturer (Novagen). Recovered protein was analyzed for purity by SDS- 
PAGE followed by Coomassie staining or Western blotting with an HRP- 
conjugated a-T7*Tag monoclonal antibody as described by the manufacturer 

20 (Novagen) or an a-Ad37 fiber rabbit antibody. 

3. Preparation of Cell Lines that Express the Ad37 fiber protein 
Plasmid pDV80 DNA was purified using the Qiagen method and 
electroporated into the adenovirus-complementing cell line El -2a S8 (see 
Examples herein; see also, Gorziglia et at. (1996) J. Virology 70:4173-4178; 

25 and Von Seggern et al. (1998) J. Gen. Virol. 79:1461-1468). Stable clones were 

selected with 600 /jg/ml zeocin (Invitrogen). 

Clones were expanded and were screened for fiber expression by indirect 
immunofluorescence (Von Seggern et al. (1998 ) J.Gen.Virol. 79:1461-1468) 
using a rabbit polyclonal antibody directed against the Ad37 fiber (a-Ad37 fiber 
30 rabbit antibody) raised by immunizing rabbits with recombinant Ad37 fiber 
protein. Two clones (lines 705 and 731) that expressed the protein at a 
uniformly high level were selected. 
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EXAWIPLE 9 

Production of Pseudotyped Ad Vector Particles 

To generate vector particles equipped ('pseudotyped') with the Ad37 
fiber protein, the Ad37 fiber-expressing 705 cells were infected (approximately 
5 1000 particles/cell) with Ad5.0gal.AF or with Ad5.GFP.AF. 

Materials and methods 

Ad5./?gal.AF 

The construction of Ad5.0gal.AF is described in Example 2 (it has been 
deposited on January 15, 1999, with the ATCC as listed above under accession 
10 number VR2636; see also. Von Seggern eta/. (1999) J.Virol. 73:1601-1608, 
copending U.S. application Serial No. 09/482,682 filed January 14, 2000, and 
also International PCT application No. PCT/US00/00265, filed January 1 4, 

2000 ). 

Ad5.GFP.AF 

Ad5.QFP.AF was constructed by recombination in bacteria using a 
modification of the AdEasy System (see, U.S. Patent No. 5,922,576; see, also 
He er a/. (1 998) Proc. Nat/. Acad. Sc/. U.S.A. 95: 2509-25 1 4; the system is 

publicly available from the authors and other sources). 

First, a fiber-deleted genomic plasmid was constructed by removing the 
20 fiber gene from pAdEasy-1 (see, U.S. Patent No. 5,922,576; and He et a/. 

(1998) Proc. Nat/. Acad. Sc/. U.S.A. 95: 2509-2514; the AdEasy system and 
vectors are publicly available from He eta/, at Johns Hopkins University). 

Plasmid pAdEasy-1 contains the entire Ad5 genome, except for nucleotides 1- 
3,533, which encompass the El genes, and nucleotides 28 „ 1 30-30,820, wh.ch 

25 encompass the E3 gene. 

Plasmid pDV43 (see Example 2; see, also Von Seggern eta/. (1999) 
J.Virol. 73:1 601-1 608) was digested with Pad, the ends blunted by treatment 
with the large fragment of £. co/i DNA polymerase and dNTPs, and the product 
re-ligated to produce plasmd P DV76. The resulting plasmid P DV76 is identical to 
30 pDV43 except for loss of the Pad site and contains the right end of the Ad5 
genome with E 3 and fiber deletions. A 4.23 kb fragment from PDV7 6 was 
amplified using the oligonucleotide primers (SEQ ID Nos. 50 and 51: 
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5' CGC GCT GAC TCT TAA GGA CTA GTT TC 3', including the unique Spel 
site in the Ad5 genome (bold); and 5' GCG CTT AAT TAA CAT CAT CAA TAA 
TAT ACC TTA TTT T 3', including a new Pad site (bold) adjacent to the right 
Ad5 1TR. Hence the resulting PCR amplified fragment contains nucleotides 
5 27,082 to 35,935 of the Ad5 genome with deletions of nucleotides 28,133 to 

32,743 (the E3 and fiber genes), and was used to replace the corresponding 
SpeMPac 1 fragment of pAdEasy 1 (see, U.S. Patent No, 5,922,576) to create 
pDV77. 

Second, E. coll strain BJ5183 was electroporated with a mixture of 
10 pDV77 and Pme\ -linearized pAdTrack as described (U.S. Patent No. 5,922,576; 
He et at. (1998) Proc. NatL Acad. Sci. U.S.A. 95: 2509-2514), and DNA was 
isolated from kanamycin-resistant colonies. The resulting plasmid, pDV83, 
contains a complete Ad5 genome with E1-, E3-, and fiber-deletions with a CMV- 
driven GFP reporter gene inserted at the site of the El deletion. The full length 
15 Ad chromosome was isolated by Pact digestion, and transfected into the El- 
and fiber-complementing 633 cells (Von Seggern et a!. (2000) J. Virol. 74. 354- 
362). The 633 cells were produced by electroporating pDV67 (SEQ ID No. 30, 
deposited under ATCC accession number PTA-1 145) into the El -2a S8 cells, 
described above. The recovered virus Ad5.GFP.AF was then plaque purified by 
20 plating on 633 cells and virus stocks were prepared by freeze-thawing cell 
pellets. 

Ad5-pseudotype particle production 
Particles with Ad5 fiber 

Ad5-pseudotyped particles were generated by virus growth in 633 cells, 
25 which express the wild type Ad5 fiber protein. Viral particles were isolated and 
purified over CsCI gradients (Von Seggern et al. (1999) J. Virol. 73:1 601-1 608; 
purified by centrifugation on preformed 1 5-40% CsCI gradients (111 ,000 x g for 
three hours at 4°CJ). For analysis of viral proteins, ten pg of the purified 
particles were electrophoresed on 8-16% gradient gels and the protein 
30 transferred to nylon membranes. The resulting blot- was probed with rabbit 
polyclonal antibodies raised against recombinant Ad37 fiber or Ad5 fiber or 
penton base proteins expressed in baculovirus-infected cells. 
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Particles with Ad37 fiber 

Cells from the Ad37 fiber producing cell line 705 were infected at 
approximately 1000 particles/cell with Ad5.yffgal.AF or with Ad5.GFP.AF. Viral 
particles were isolated and purified over CsCI gradients. The bands were 
5 harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% NaCI, and 
10% glycerol), aliquoted and stored at -70°C. 

Viral protein analyses 

For analysis of viral proteins, 10 pg of purified Ad5.0gal.AF particles with 
no fiber (grown in 293 cells), the Ad5 fiber (grown in 633 cells), or the Ad37 
10 fiber (grown in 705 cells) were electrophoresed by 8-16% polyacrylamide 

gradient SDS-PAGE and the proteins were transferred to nylon membranes. The 
blot was then probed with o-Ad37 fiber rabbit antibody. Ad5 fiber and penton 
base were detected by reprobing the blot with polyclonal antibodies raised 
against recombinant proteins expressed in baculovirus-infected cells (Wickman et 

15 a/. (1993) Cell 2:309-31 9). 

Adenovirus infection and cell binding assays 

Adherent Chang C and A549 cells were infected with GFP expressing 
Ad 5 vectors containing the Ad5 fiber (Ad5.GFP.AF/5F) or the Ad37 fiber 
(Ad5.GFP AF/37F) at 10,000 particles per cell for 3 hours at 37 C, 5% CO z in 

20 DMEM, 10%FCS. Cells were washed twice with saline and then cultured 

overnight at 37 ‘C, 5% C0 2 . The next day, the cells were detached with buffer 
containing 0.05% (w/v) trypsin and 0.5 mM EDTA (Boehringer Mannheim) for 5 
minutes at 37 *C. Suspended cells were washed once with PBS and then 
resuspended in phosphate-buffered saline (PBS), pH 7.4. GFP fluorescence was 
25 measured with a FACScan flow cytometer. A threshold established by the 

fluorescence of uninfected cells was used to distinguish cells expressing GFP. 

To assess the role of CAR in Ad infection, 10,000 attached cells were pre- 
incubated with 1 80 //g/ml RmcB, a function-blocking anti-CAR monoclonal 
antibody (Hsu et aL (1988 ) J. Virol. 62:1647-1652), in complete DMEM for 1 
30 hour at 4°C. A small volume containing Ad5.GFP.AF/5F or Ad5.GFP.AF/37F 
was then added at 10,000 particles per cell. The cells were infected for 3 
hours, cultured overnight, harvested, and analyzed for GFP expression. Percent 




WO 01/83729 



PCT/EP01/04863 



-89- 

cells expressing GFP was determined by the percent of cells detected above a 
threshold set by the fluorescence of uninfected Chang C cells. 

To measure adenovirus binding to cells, wild type Ad37 was labeled with 
125 l using lodogen (Pierce) according to manufacturer instructions and separated 
5 from free 12S I by gel filtration as described (Huang et ai. (1999) J. Virol. 

73:2798-2802). Binding of radiolabeled wild type Ad37 on Chang C cells was 
then quantitated as described (Huang et al. (1999)3. Virol. 73: 2798-2802). 
Non-specific binding was determined by incubating cells and labeled Ad37 
particles in the presence of 100-fold concentration of unlabeled Ad37. Specific 
10 binding was calculated by subtracting the non-specific binding from the total 
cpm bound. To examine if divalent cations are required for binding, 10 mM 
ethylenediaminetetraacetic acid (EDTA) or various concentrations of CaCI 2 , or 
MgCI 2 were added to cells before incubation with labeled virus. To examine if 
the receptor for Ad37 is a protein, cells were pretreated with 1 0 pg/ml trypsin 
15 (GIBCO), subtilisin (Sigma), proteinase K (Boehringer-Mannheim), and bromelain 
(Sigman) at 37 * C for 1 hour, then washed twice with complete DMEM before 
adding labeled virus. Cells were >95% viable after protease treatment. 

Ad37 binding to conjunctival cells is calcium-dependent. Specific 1ZS I- 
labeled Ad37 binding to Chang C cells was measured in the presence of 10 mM 
20 EDTA and in the presence of varying concentrations of calcium chloride or 
magnesium chloride. Specific binding was determined by subtracting the 
nonspecific counts in the presence of 100-fold excess unlabeled virus from the 
total counts. 

Pretreatment of conjunctival cells with proteases inhibits Ad37 binding. 
25 Change C cells were pretreated with various proteases for 1 hour before binding 
125 l-labeled Ad37 to the cells. Nonspecific binding was measured by adding 
100-fold unlabeled Ad37 to cells with 125 l-labeled Ad37 and subtracting from 
total counts for specific binding. Percent inhibition represents the difference in 
specific binding of untreated cells and pretreated cells as a percentage of the 
30 specific binding of untreated cells. 
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Virus overlay protein blot assay (VOPBA) 

For VOPBA of human conjunctival membrane proteins probed with Ad37 
in the presence of EDTA or calcium chloride, Chang C membrane fractions were 
separated by 8% SDS-PAGE and transferred to a PDVF membrane. The 
5 membrane was subsequently probed with or without whole Ad37 particles, a 
polyclonal antibody against Ad37 fiber, and finally a horseradish peroxidase 
conjugated anti-rabbit antibody, in the presence of EDTA or calcium chloride. 
Transferred Chang C membrane proteins were probed with recombinant Ad37 
knob protein, instead of Ad37 knob, in the presence of calcium chloride. 

10 Confluent monolayers of Chang C and A549 cells were detached by 

scraping, pelleted by centrifugation, and then resuspended in 250 mM sucrose, 

20 mM HEPES, pH 7.0, 1 mM EDTA, and 2 yvg/ml aprotinin and leupeptin. Cells 
were transferred into a dounce homogenizer and disrupted with 30 strokes. 
Organelles and nuclei were pelleted at 500g for 15 min. Plasma membrane 
1 5 fragments were then pelleted from the supernatant of cell lysates at 200,000g 
for 1 hour and then resuspended in 10 mM Tris*CI, pH 8.1, 10//g/ml aprotinin 

and leupeptin. 

Cell membranes of Chang C or A549 cells were incubated (1:1) with a 
2% SDS, non-reducing buffer and separated on an 8% polyacrylamide gel 
20 without boiling. Membrane proteins were then electroblotted onto a PVDF 

membrane (Immobilon-P) and blocked in 5% (w/v) milk in PBS, pH 7.4, 0.02% 
Tween-20 (PBS-T). After blocking, the membrane was incubated with 1 pg/ml 
wild-type Ad19p or Ad37 in 0.5% (w/v) milk in PBS-T, 1 mM CaCI 2 , for 1 hour 
at room temperature. The membrane was then washed once with phosphate- 
25 buffered saline, pH 7.4 (PBS), 1 mM CaCI 2 , and incubated with 1:500 dilution of 
a-Ad37 fiber rabbit antibody in 0.5% (w/v) milk in PBS-T, ImM CaCI 2 , for 30 
minutes at room temperature. The membrane was washed again with PBS, 1 
mM CaCI 2 , and incubated with 1 :5000 dilution of horseradish peroxidase (HRP) 
conjugated a-rabbit antibody (Sigma) in 0.5% (w/v) milk in PBS-T, 1 mM CaCI 2 , 
30 for 30 minutes at room temperature. The membrane was washed four times in 
PBS. 1 mM CaCI 2 , once with PBS-T, 1 mM CaCI 2 , and once in 1 mM CaCl 2 . The 
blot was developed with enhanced chemiluminescence reagents (Pierce) for 5 
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minutes and placed onto a piece of Biomax film (Kodak) for 5 seconds to 1 

minute. For divalent metal cation experiments, membranes were incubated in 

the presence of 2 mM EDTA instead of 1 mM CaCI 2 in all solutions. To assay 

fiber knob binding to cell membrane proteins, membrane filters were incubated 

5 with 1 jug/ml purified T7-tagged Ad37 knob protein in Tris-buffered saline, 0.1 % 

Tween-20, 1 mM CaC1 2 , for 1 hour at room temperature. o-Ad37 fiber rabbit 

antibody and HRP-conjugated anti-rabbit antibody were applied and the 

membrane was developed with substrate solution as described above. 

Results: Comparison of adenovirus infection of human conjunctival and lung 
10 epithelial cells with virus particles retargeted with Ad5 or Ad37 fiber proteins 

Packaging cell lines producing the Ad37 fiber protein were generated. 
Since the N-terminal amino acid sequences of the Ad5 and Ad37 fiber proteins 
differ significantly, and to ensure that the Ad37 fiber would be efficiently 
incorporated into Ad5 vector particles, several residues in the wild-type Ad37 
15 fiber were mutated to more closely match the Ad5 sequence. Stable cell lines 
producing this fiber under control of the CMV promoter and the adenovirus type 
5 tripartite leader were then generated and screened for fiber expression by 
indirect immunofluorescence. One clone (line 705), which expressed the Ad37 
fiber at a high level, was selected for further study. 

20 Cells from one cell line 633, which expresses the wild-type Ad5 fiber 

protein, and line 705 were infected with a fiber-deleted Ad5 vector carrying a 
0galactosidase reporter gene. The resulting vector particles contained the Ad5 
fiber protein (Ad5.0gal.AF/5F) and the Ad37 fiber protein (Ad5.0gal.AF/37F), 
respectively. Incorporation of the correct fiber protein into viral particles was 
25 verified by Western blotting. Adenoviral vectors containing the GFP reporter 
gene, Ad5.GFP.AF/5F and Ad5.GFP.AF/37F, were created in the same fashion. 

Infection of a variety of cell types using the retargeted adenovirus 
particles was examined. As assayed by GFP fluorescence, Ad5.GFP.AF/5F 
exhibited good gene delivery to lung epithelial (A549) and conjunctival celis 
30 (Chang C). In contrast, Ad5.GFP.AF/37F efficiently delivered GFP to Chang C 
cells, but exhibited very poor gene delivery to A549 cells. Although CAR is 
expressed on the surface of A549 cells, as indicated by AD5.GFP.AF/5F 
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infection, Ad5.GFP.AF/37F was unable to infect these cells efficiently. This 
experiment shows that the Ad37 fiber protein can confer preferential infection of 
human conjunctival cells, but not CAR-expressing human lung epithelial cells. 

Hence CAR.is not the primary receptor for Ad37. Recent studies reported 

5 that expression of CAR on the surface of Chinese hamster ovary (CHO) cells did 
not improve Ad37 binding (Arnberg et at. (2000) J. Virol. 74:42-48), implying 
that Ad37 does not use CAR as a primary receptor. In order to verify this on 
human conjunctival cells, A549 and Chang C cells were pretreated with RmcB 
(Hsu et al. (1988 ) J.Virol. 62: 1647-1652). a function-blocking monoclonal 
10 antibody against CAR. The RmcB antibody inhibited infection of A549 cells by 
Ad5.GFP.AF/5F, but it had little effect on infection of Chang C cells by 
Ad5.GFP.AF/37F. This indicates that CAR is not the primary receptor for Ad37 
on Chang C conjunctival cells. 

Ad37 binding to conjunctival cells requires divalent metal cations. It has 
15 been proposed (Roelvink et al. (1998) J.Virol. 72: 7909-7915) that a combination 
of fiber binding to CAR and penton base binding to cr v -integrins allows some 
adenovirus serotypes to attach to cells. Although o v -integrin binding to the RGD 
motif of the adenovirus penton base is of relatively low affinity (Wickman et al. 
(1993) Cell 2:309-319), it may nonetheless contribute to viral attachment to the 
20 cell surface. Ad37 shows a particularly strong affinity for binding to integrin cr^s 
(Mathias etal. (1998) J.Virol. 72:8669-8675), suggesting that integrin aj s 
might be a primary receptor for Ad37. Binding of the RGD motif by cr v -integrins 
requires the presence of divalent cations, such as calcium or magnesium (Stuiver 
et al. (1 996) J.Cell Physiol. 168: 521-531 ). In contrast, no divalent cations were 
25 required for binding in the CAR-Adl 2 knob complex (Bewley et at. (1 999) 
Science 255:1579-1583). 

To investigate the potential role of cr/mtegrins and divalent metal cations 
in Ad37 receptor binding, 12S Mabeled Ad37 binding to Chang C cells was 
examined in the absence or presence of EDTA. EDTA inhibited Ad37 binding to 
30 conjunctival cells but did not alter Ad5 binding. These findings suggest a 
requirement for divalent metals for Ad37 binding. 
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The presence of either calcium or magnesium ions helps aJ3 5 organize in focal 
contacts (Stuiver et at. (1996) J. Cell Physiol. 765:521-531), suggesting that 
calcium and magnesium aid in integrin ajl 5 function. To further test the 
potential role of integrin a v p s in Ad37 cell attachment, 125 l-labeled Ad37 binding 
5 to Chang C cells was measured in the presence of varying concentrations of 
calcium or magnesium chloride. Magnesium ions had little effect on Ad37 
binding to Chang C cells. In contrast, calcium ions dramatically enhanced Ad37 
binding to Chang C cells. The optimal concentration of calcium chloride for 
Ad37 binding was 1 mM, while higher concentrations of calcium actually 
10 decreased virus binding to cells. The fact that calcium, but not magnesium, 
promoted Ad37 attachment is not consistent with integrin aj ! s as the primary 
receptor for viral attachment to the cells since either metal will support ligand 
binding to integrin aj*. Moreover, A549 cells express abundant or v -integrins 
(Mathias et a/. (1998) J. Virol. 72:8669-8675) but were unable to support 
15 efficient binding of Ad37. 

Wild-type Ad37 particles bind to three conjunctival membrane proteins. 
Recent studies reported that protease treatment of CHO cells abolished Ad37 
binding (Arnberg et al. (2000) J. Virol. 74: 42-48), implying that Ad37 bound to 
a protein receptor on CHO cells. Scatchard analysis of Ad37 binding to Chang C 
20 cells showed that each cell expresses approximately 24,000 fiber binding sites 
(Huang et at. (1999) J. Virol. 73: 2798-2802). To determine if the Ad37 binding 
site on human conjunctival cells is also a protein, Chang C cells were treated 
with different proteases prior to measuring binding of 125 l-labeled Ad37. 

Digestion of surface proteins by all four proteases inhibited Ad37 binding to 
25 Chang C cells by greater than 50%. This finding showed that Ad37 also binds 
to a protein receptor on Chang C cells. 

Virus overlay protein blot assays (VOPBAs) were used to identify 
candidate viral protein receptors. This Western blot technique uses intact viral 
particles in place of antibodies to probe viral-receptors interactions. VOPBA was 
30 used herein to identify Chang C membrane proteins that bind to Ad37. In the 
absence of Ad37 particles, no protein bands were observed, while addition of 
virus in the absence of calcium revealed binding to a single 45 kDa protein. In 
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the presence of 1 mM calcium chloride, Ad37 reacted with three proteins with 
approximate molecular weights of 45, 50 and 60 kDa. The same three proteins 
were detected using a recombinant Ad37 fiber knob alone, indicating that Ad37- 
receptor interactions are fiber mediated and do not involve interactions of other 
5 capsid proteins such as the penton base. The size of the calcium-independent 
protein (45 kDa) is very similar to the known molecular weight of CAR. A direct 
comparison of the Ad37 VOPBA and a CAR Western blot showed that the 45 
kDa receptor co-migrates with CAR on SDS-PAGE. Moreover, two other 
members of subgroup D adenoviruses, Ad9 and ADI 5, have been shown to bind 
10 to CAR (Roelvink et al. (1998) J. Virol. 72: 7909-7915). 

Since CAR does not appear to mediate Ad37 binding on intact Chang C 
cells, the possibility that the 50 or 60 kDa protein serves this function was 
tested by examining an adenovirus serotype that does not bind to Chang C cells. 
Adi 9p, a closely related subgroup D adenovirus, binds poorly to Chang C cells 
15 (Huang et al. (1999) J. Virol. 73: 2798-2802) and Ad19p recognition of the 
Ad37 receptor is therefore unlikely. Ad19p particles bound to the 45 and 60 
kDa receptors in the VOPBA, but did not bind to the 50 kDa receptor. 

Moreover, the 50 kDa receptor is expressed on Chang C cells, but not A549 
cells, which only support tow levels of Ad37 binding and infection. Taken 
20 together, these data indicate that the 50 kDa protein is a primary candidate 
receptor for Ad37 on human conjunctival cells. 

Discussion 

The identification of the CAR protein as a major adenovirus receptor does 
not explain why certain subgroup D members, such as Ad37, preferentially 
25 infect ocular cells. A 50 kDa human conjunctival cell membrane protein is 

identified herein as a primary candidate for the receptor for Ad37. This 50 kDa 
protein is not present on A549 lung epithelial cells. Ad37 binding to this receptor 
is calcium-dependent, which is consistent with Ad37 binding and infection 
experiments. Ad37 also bound to a 60 kDa protein that is present on human 
30 conjunctival and lung epithelial cells. It does not, however, appear to be 

serotype specific. The molecular weights of MHC class I heavy chain, which has 
been proposed as a receptor for Ad5, and aj 3 and aj 5 intergrins, receptors for 
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the penton base, are distinct from the 50 or to kDa receptor characterized in this 
study. 

The studies of Ad37-receptor interaction using VOPBAs are consistent 
with previous studies showing that subgroup D adenoviruses can bind to the 
5 extracellular domain of CAR (Roelvink et a/.. (1998) J.Virol. 72: 7909-7915). 
Biochemical and structural studies on knob-CAR interactions indicate that the 
CAR binding site is located on the AB-loop of the fiber knob. Alignment of the 
fiber sequences of Ad37 and other adenoviruses reveals that the AB-loop of 
Ad37 is similar to those of Adi 2 and Ad5. Moreover, a phylogenetic tree of 
10 adenovirus knobs (Roelvink et at. (1 998) J.Virol. 72:7909-791 5) shows that 

fiber proteins of subgroup D are similar to those of subgroup C and E, which use 
CAR as their primary receptor. Ad37 does not, however, appear to effectively 
use CAR as a primary receptor, as demonstrated by virus binding and infection 
studies on Chang C conjunctival cells and A549 lung epithelial cells. 

15 It has been reported that Ad37 uses sialic acid as a receptor on Chinese 

hamster ovary (CHO) cells and human lung carcinoma (A549) cells (Arnberg et 
at. ((2000) J. Virol. 74:42-48). Human conjunctival cells were not studied. 
Human corneal epithelial (HCE) cells were the only ocular cell line studied and 
Ad37 binds relatively poorly to these cells, compared to binding on A549 cells 
20 (Arnberg et al. 1(2000) J. Virol. 74:42-48). In addition, 8,4 X 10 7 wheat germ 
agglutinin molecules per cell were required to significantly inhibit Ad37 binding 
to sialic acid on sialic acid positive CHO cells (Arnberg et ai. (2000) J. Virol. 
74:42-48), three orders of magnitude higher than the number of Ad37 receptors 
on Chang C conjunctival cells (Huang et ai. (1999) J. Virol. 73: 2798-2802). 

25 Clearly, sialic acid is not the only factor responsible for Ad37 binding to the cell 
surface and its influence on Ad37 tropism is unclear. 

The results herein show that Ad37 selects a 50 kDa cellular receptor for 
binding to conjunctival cells, but it is possible that sialic acid also plays a role in 
this interaction. The characterization and identification of the Ad37 receptor 
30 have therapeutic implications and also explain the different tropism of Ad37. 
The 50 kDA receptor for Ad37 may also be the receptor for other subgroup D 
adenoviruses that cause severe cases of EKC, Ad19a and Ad8. Ad19p is a 
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nonpathogenic variant of Adi 9 (Arnberg etal. (1998) Virology 227:239-244) 
while Adi 9 A, along with Ad8 and Ad37, are major causes of EKC. Ad19a and 
Ad37 have identical fiber proteins (Arnberg etal. (1998) Virology 227: 239-244) 
and have similar tropism in vivo. Ad8, Adi 9a. and Ad37 agglutinate dog and 
5 guinea pig erythrocytes more effectively than four other serotypes that are 
associated with less severe forms of conjunctivitis (Arnberg etal. (1998) 

Virology 227: 239-244), implying that the receptors of Adi 8, Adi 9 A, and Ad37 
have similar characteristics. Hence, this 50 kDa receptor is an attractive drug 
target against EKC caused by adenoviruses to provide therapeutic intervention of 

10 ocular diseases associated with these viruses. 

EXAMPLE 10 

Targeting of the Ad5 vector to photoreceptor cells 

The fiber-deleted adenovirus vector Ad5.GFP.AF was propagated in 705 
cells, which express a modified Ad37 fiber protein. Viral particles 
15 (Ad5.GFP.Af/37F) were harvested, CsCI-purified and dialized into 0.9% NaCl, 10 
mM Tris, pH 8.1, and 10% glycerol. Two to three g\ of the resulting solution, 
containing approximately 1x10 s particles///! was injected into the vitreous 
chamber of a mouse eye. Seven days post-injection, eyes were harvested, fixed 
with paraformaldehyde and cryo-sectioned. Sections were stained with an anti- 
20 rhodopsin antibody to identify photoreceptor cells and with DAPI to show all cell 
nuclei. The resulting sections showed red anti-rhodopsin staining in the 
photoreceptors, blue DAPI-stained nuclei, and green GFP staining in any 
transduced cells. The results revealed substantially exclusive transduction of 
photoreceptors. Co-localization of rhodopsin staining and GFP expression 
25 indicated selective transduction of photoreceptor cells. 

As a control, contralateral eyes were injected with a stock of the fiber- 
deleted vector AD5./?gal.AF grown in the same Ad37 fiber-expressing cells. 
Since this virus (Ad5.£gal.AF/37F) produces 0gal rather than GFP, the green 

staining is absent from the photoreceptors. 

30 Additional experiments using the AD37 fiber for targeting to the 

photoreceptor cells have been performed. Subretinal and intravitreal injection 
have been used in mouse models and the results demonstrate targeting to the 
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photoreceptors. As with intravitreally injected eyes, the major cell type infectd 

via subretinal administration was the photoreceptor. 

As noted, Ad5.GFP.AF /37F infected Chang C cells efficiently, but A549 
cells poorly. Ad37 fiber protein confers preferential infection on human 
5 conjunctival cells, but not CAR-expressing human lung epithelial cells. Binding 
to conjunctival cells requires divalent cations. 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims, 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule, comprising: 

adenovirus inverted terminal repeat sequences; an adenovirus packaging 
signal operatively linked thereto; and a photoreceptor-specific promoter. 

5 2. The isolated nucleic acid molecule of claim 1, further comprising a 

nucleic acid encoding a therapeutic product operatively linked to the promoter. 

3 . The isolated nucleic acid molecule of claim 1, wherein the 
promoter is a rhodopsin promoter. 

4 . The nucleic acid molecule of claim 1 , wherein the adenovirus 
10 genome does not encode a functional fiber protein such that packaging the 

nucleic acid requires complementation in a packaging cell. 

5 . A recombinant adenovirus vector, comprising the nucleic acid 

molecule of any of claims 1-4 packaged therein. 

6 . A recombinant adenovirus vector of claim 5, wherein inverted 
15 terminal repeat sequences (ITR) and a packaging signal are derived from 

adenovirus type 2 or adenovirus type 5. 

7 . A recombinant adenovirus vector of claim 5, wherein the virus 

comprises a fiber protein. 

8 . A recombinant adenovirus vector of claim 7, wherein the fiber 

20 protein selectively binds to photoreceptors in the eye of a mammal. 

9 . A recombinant adenovirus vector of claim 7, wherein the fiber is a 
chimera composed of N-terminal sequences from adenovirus type 2 or type 5, 
and a sufficient portion of an adenovirus serotype D fiber for selective binding to 
photoreceptors in the eye of a mammal.. 

25 10 . A method for targeted delivery of a gene product to the eye of a 

mammal, comprising: 

administering a recombinant adenovirus virus that comprises heterologous 
DNA encoding the gene product or resulting in expression of the gene product, 
wherein the recombinant vims comprises a fiber protein that specifically or 
selectively binds to receptors that are expressed on cells in the eye. 
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11. The method of claim 10, wherein the cells are photoreceptors. 

12. The method of claim 10, wherein administration is effected by 

intraocular delivery. 

13. The method of claim 10, wherein administration is effected by a 

5 method selected from subretinal injection, intravenous administration, periorbital 

administration, and intravitreal administration. 

• 14. The method of claim 10, wherein the recombinant virus comprises 

a fiber protein from an adenovirus type D serotype. 

15. The method of any of claims 10-14, wherein the fiber protein is an 

10 adenovirus type 37. 

1 6. The method of any of claims 1 0-1 4, wherein the fiber is a chimeric 
protein containing a sufficient portion of the N-terminus of an adenovirus type 2 
or type 5 fiber protein for interaction with an adenovirus type 2 or type 5 
penton, and a sufficient portion of an adenovirus serotype D knob portion of the 

15 fiber for selective binding to photoreceptors in the eye of a mammal. 

17. The method of any of claims 1 0-1 6, wherein the recombinant virus 

is an adenovirus type D serotype. 

18. The method of any of claims 10-17, wherein the encapsulated 
nucleic acid comprises a photoreceptor-specific promoter operatively linked to a 

20 nucleic acid comprising the therapeutic product. 

19. The method of claim 18, wherein the therapeutic product is 
selected from the group consisting of a trophic factor, an anti-apoptotic factor, a 
gene encoding a rhodopsin protein, a wild-type Stargardt disease gene (STDG1), 
an anti-cancer agent and a protein that regulates expression of a photoreceptor- 

25 specific gene product. 

20. The method of any of claims 1 0-1 9, wherein delivery is effected 

for treatment of an ocular disease. 

21 . The method of claim 20, wherein the disorder is a retinal 

degenerative disease. 

3Q 22. The method of claim 20, wherein the disease is retinitis 

pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, or 

retinoblastoma. 
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23. The method of any of claims 10-22, wherein the mammal is a 
human. 

24. The method of any of claims 10-22, wherein the viral nucleic acid 
comprises: 

5 an adenovirus inverted terminal repeat (ITR) sequences; and 

an adenovirus packaging signal operatively linked thereto. 

25. The method of claim 24, wherein the ITRs and packaging signal 

are derived from an adenovirus serotype B or C. 

26. The method of claim 24, wherein the ITRs and packaging signal 

10 are derived from an adenovirus type 2 or 5. 

27. The method of claim 24, wherein the viral nucleic acid further 

comprises a photoreceptor-specific promoter. 

28. A method of targeted gene therapy, comprising: 
administering a recombinant viral vector that comprises an adenovirus 

15 type 37 fiber protein or portion thereof, whereby the vector selectively 
transduces photoreceptors and delivers a gene product encoded by the 
recombinant viral vector; wherein the portion is sufficient for selective binding to 

photoreceptors. 

29. The method of claim 28, wherein the vector is administered into 
20 the eye. 

30. The method of claim 28, wherein the vector is administered to the 
vitreous cavity of the eye. 

31. The method of claim 28, wherein administration is effected by 
subretinal injection, intravenous administration, periorbital administration or 

25 intravitreal administration. 

32. The method of any of claims 10-31, wherein at least about 10 

plaque forming units of virus are administered. 

33. The method of any of claims 10-31, wherein about 1 plaque 
forming unit to about 10 14 plaque forming units of virus are administered. 
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SEQUENCE LISTING 



<110> VON SEGGERN, DANIEL 
NEMEROW, GLEN R. 

FRIEDLANDER, MARTIN 

<120 > VECTORS FOR OCULAR TRANSDUCTION AND USE THEREFOR FOR GENETIC THERAPY 
<130> 7S6.1PCT/NOV0205P 



<140> 

<14 1> 2001-05-01 

<150> 09/562,934 
<151> 2000-05-01 

<160> 51 

<170> Patentln Ver . 2.1 

< 210 > 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



< 220 > 

<223> Description of Artificial Sequence: primer 

<400> 1 30 

cggtacacag aattcaggag acacaactcc 



< 210 > 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: primer 

<400> 2 ^ _ 

gcctggatcc gggaagttac gtaacgtggg aaaac 



<210> 3 
< 211 > 12 
<212 > DNA 

<213 > Artificial Sequence 

<220> i . -i 

<223> Description of Artificial Sequence: linker 

<400> 3 12 

cgcggatccg eg 



<210> 4 
<211> 8710 
<212> DNA 

<213> Artificial Sequence 
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<223 > Description of Artificial Sequence: plasmid 



<400> 4 
cacctaaatt 
ctcatttttt 
cgagataggg 
ctccaacgtc 
accctaatca 
gagcccccga 
gaaagcgaaa 
caccacaccc 
gcgcaactgt 
agggggatgt 
ttgtaaaacg 
gccccccctc 
caagtgcata 
tatttgccac 
ttatgtttca 
agtatagccc 
accctagtat 
cggctggcct 
cacacggttt 
tcacttaagt 
tgcttaacgg 
atcaggatag 
gtcctgcagg 
ataaggcgcc 
taactgcagc 
aagctcatgg 
aagtggcgac 
ttcaccacct 
ctaaaccagc 
caatgacagt 
atgttggcac 
gttagaacca 
ggaagacctc 
ggatgatcct 
ctgtacggag 
acgccggacg 
gcgtctccgg 
caaagcatcc 
cctgataaca 
cgagtcacac 
aaagattatc 
cgtggtcaaa 
cttccaaaag 
gaatctcctc 
accttctcaa 
gctccagagc 
ttcctcacag 
taggtccctt 
cacttccccg 
agctatgcta 
gcaaggtgct 
catgctcatg 
ttctctcaaa 
ttaaacatta 
tacggccatg 
cagctcctcg 
catcggtcag 
gagacaacat 
aaacacctga 



gtaagcgtta 
aaccaatagg 
ttgagtgttg 
aaagggcgaa 
agttttttgg 
tttagagctt 
ggagcgggcg 
gccgcgctta 
tgggaagggc 
gctgcaaggc 
acggccagtg 
gaggtcgacg 
ctctatgtca 
atcctcttac 
acgtgtttat 
caccaccaca 
tcaacctgcc 
taaaaagcat 
cctgtcgagc 
tcatgtcgct 
gcggcgaagg 
ggcggtggtg 
aatacaacat 
ttgtcctccg 
acagcaccac 
cggggaccac 
ccctcataaa 
cccggtacca 
tggccaaaac 
ggagagccca 
aacacaggca 
tatcccaggg 
gcacgtaact 
ccagtatggt 
tgcgccgaga 
tagtcatatt 
tctcgccgct 
aggcgccccc 
tccaccaccg 
acgggaggag 
caaaacctca 
ctctacagcc 
gcaaacggcc 
tataaacatt 
tatatctcta 
gccctccacc 
acctgtataa 
cgcagggcca 
ccaggaacct 
accagcgtag 
gctcaaaaaa 
cagataaagg 
catgtctgcg 
gaagcctgtc 
ccggcgtgac 
gtcatgtccg 
tgctaaaaag 
tacagccccc 
aaaaccctcc 



atattttgtt 
ccgaaatcgg 
ttccagtttg 
aaaccgtcta 
ggtcgaggtg 
gacggggaaa 
ctagggcgct 
atgcgccgct 
gatcggtgcg 
gattaagttg 
aattgtaata 
gtatcgataa 
ttttcatggg 
actttttcat 
ttttcaattg 
tagcttatac 
acctccctcc 
catatcatgg 
caaacgctca 
gtccagctgc 
agaagtccac 
ctgcagcagc 
ggcagtggtc 
ggcacagcag 
aatattgttc 
agaacccacg 
cacgctggac 
tataaacctc 
ctgcccgccg 
ggactcgtaa 
cacgtgcata 
aacaacccat 
cacgttgtgc 
agcgcgggtt 
caaccgagat 
tcctgaagca 
tagatcgctc 
tggcttcggg 
cagaataagc 
cgggaagagc 
aaatgaagat 
aaagaacaga 
ctcacgtcca 
ccagcacctt 
agcaaatccc 
ttcagcctca 
gattcaaaag 
gctgaacata 
tgacaaaaga 
ccccgatgta 
tcaggcaaag 
caggtaagct 
ggtttctgca 
ttacaacagg 
cgtaaaaaaa 
gagtcataat 
cgaccgaaat 
ataggaggta 
tgcctaggca 



aaaattcgcg 
caaaatccct 
gaacaagagt 
tcagggcgat 
ccgtaaagca 
gccggcgaac 
ggcaagtgta 
acagggcgcg 
ggcctcttcg 
ggtaacgcca 
cgactcacta 
gcttgatatc 
actggtctgg 
acattgccca 
cagaaaattt 
agatcaccgt 
caacacacag 
gtaacagaca 
tcagtgatat 
tgagccacag 
gcctacatgg 
gcgcgaataa 
tcctcagcga 
cgcaccctga 
aaaatcccac 
tggccatcat 
ataaacatta 
tgattaaaca 
gctatacact 
ccatggatca 
cacttcctca 
tcctgaatca 
attgtcaaag 
tctgtctcaa 
cgtgttggtc 
aaaccaggtg 
tgtgtagtag 
ttctatgtaa 
cacacccagc 
tggaagaacc 
ctattaagtg 
taatggcatt 
agtggacgta 
caaccatgcc 
gaatattaag 
agcagcgaat 
cggaacatta 
atcgtgcagg 
acccacactg 
agctttgttg 
cctcgcgcaa 
ccggaaccac 
taaacacaaa 
aaaaacaacc 
ctggtcaccg 
gtaagactcg 
agcccggggg 
taacaaaatt 
aaatagcacc 



ttaaattttt 
tataaatcaa 
ccactattaa 
ggcccactac 
ctaaatcgga 
gtggcgagaa 
gcggtcacgc 
tcccattcgc 
ctattacgcc 
gggttttccc 
tagggcgaat 
gaattcagga 
ccacaactac 
agaataaaga 
caagtcattt 
accttaatca 
agtacacagt 
tattcttagg 
taataaactc 
gctgctgtcc 
gggtagagtc 
actgctgccg 
tgattcgcac 
tctcacttaa 
agtgcaaggc 
accacaagcg 
cctcttttgg 
tggcgccatc 
gcagggaacc 
tcatgctcgt 
ggattacaag 
gcgtaaatcc 
tgttacattc 
aaggaggtag 
gtagtgtcat 
cgggcgtgac 
ttgtagtata 
actccttcat 
caacctacac 
atgttttttt 
aacgcgctcc 
tgtaagatgt 
aaggctaaac 
caaataattc 
tccggccatt 
catgattgca 
acaaaaatac 
tctgcacgga 
attatgacac 
catgggcggc 
aaaagaaagc 
cacagaaaaa 
ataaaataac 
cttataagca 
tgattaaaaa 
gtaaacacat 
aatacatacc 
aataggagag 
ctcccgctcc 



gttaaatcag 60 
aagaatagac 120 
agaacgtgga 180 
gtgaaccatc 240 
accctaaagg 300 
aggaagggaa 360 
tgcgcgtaac 420 
cattcaggct 480 
agctggcgaa 540 
agtcacgacg 600 
tgggtaccgg 660 
gacacaactc 720 
attaatgaaa 780 
atcgtttgtg 840 
ttcattcagt 900 
aactcacaga 960 
cctttctccc 1020 
tgttatattc 1080 
cccgggcagc 1140 
aacttgcggt 1200 
ataatcgtgc 1260 
ccgccgctcc 1320 
cgcccgcagc 1380 
atcagcacag 1440 
gctgtatcca 1500 
caggtagatt 1560 
catgttgtaa 1620 
caccaccatc 1680 
gggactggaa 1740 
catgatatca 1800 
ctcctcccgc 1860 
cacactgcag 1920 
gggcagcagc 1980 
acgatcccta 2040 
gccaaatgga 2100 
aaacagatct 2160 
tccactctct 2220 
gcgccgctgc 2280 
attcgttctg 2340 
ttttattcca 2400 
cctccggtgg 2460 
tgcacaatgg 2520 
ccttcagggt 2580 
tcatctcgcc 2640 
gtaaaaatct 2700 
aaaattcagg 2760 
cgcgatcccg 2820 
ccagcgcggc 2880 
gcatactcgg 2940 
gatataaaat 3000 
acatcgtagt 3060 
gacaccattt 3120 
aaaaaaacat 3180 
taagacggac 3240 
gcaccaccga 3300 
caggttgatt 3360 
cgcaggcgta 3420 
aaaaacacat 3480 
agaacaacat 3540 
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acagcgcttc 
aaaaaacacc 
gcagagcgag 
ccagaaaacc 
caaatcgtca 
tctccgcaag 
ccggcttcca 
cagacaaggt 
gcggcataaa 
gcgagcgatc 
tgcaacgcgg 
cagcctcgcg 
ttcatccccg 
catgtcttta 
aacacgcaga 
cgtgtggcct 
ccgcagatcc 
agtttctgat 
aatctcgtgc 
gcgccgatgg 
cgattccgga 
gccgtgcaca 
agccggtcgc 
tcggcccatt 
cgattgctga 
ccgtcgcgca 
acctcgtgca 
cggtcattga 
tcttctggag 
atccggagct 
aactctatca 
gcgacgcaat 
gcgcggccgt 
ccagcactcg 
aggagacaat 
acgggtgttg 
gataccccac 
caccccccaa 
gccatagcca 
tcgtgggggt 
cagacccatg 
ccgggcgtct 
cgcccagtgc 
gccctgacgg 
gagctgcatg 
taatcagcca 
cctgaacctg 
taatggttac 
gcattctagt 
gttctagagc 
atttcgagct 
acaattccac 
gtgagctaac 
tcgtgccagc 
cgctcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgetetG 
ggaagcgtgg 
cgctccaagc 



acagcggcag 
actcgacacg 
tatatatagg 
gcacgcgaac 
cttccgtttt 
aattgattgg 
ttcaggtcga 
atagggcggc 
tcgccgtgac 
cttgaagctg 
gcatcccgat 
tcgcgaacgc 
tggcccgttg 
gttctatgat 
tgcagtcggg 
cgaacaccga 
cgggcaatga 
cgaaaagttc 
tttcagcttc 
tttctacaaa 
agtgcttgac 
gggtgteacg 
ggaggccatg 
cggaccgcaa 
tccccatgtg 
ggctctcgat 
cgcggatttc 
ctggagcgag 
gccgtggttg 
tgcaggatcg 
gagcttggtt 
cgtccgatcc 
ctggaccgat 
tccgagggca 
accggaagga 
ggtcgtttgt 
cgagacccca 
gttcgggtga 
ctggccccgt 
tattattttg 
gtttttggat 
gtggctgcca 
cgtcgaccgg 
gcttgtctgc 
tgtcagaggt 
taccacattt 
aaacataaaa 
aaataaagca 
tgtggtttgt 
ggccgccacc 
tggcgtaatc 
acaacatacg 
tcacattaat 
tgcattaatg 
cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 



cctaacagtc 
gcaccagctc 
actaaaaaat 
ctacgcccag 
cccacgttac 
ctccaattct 
ggtggcccgg 
gcctacaatc 
gatcagcggt 
tccctgatgg 
gccgccggaa 
cagcaagacg 
ctcgcgtttg 
gacacaaacc 
gcggcgcggt 
gcgaccctgc 
gatatgaaaa 
gacagcgtct 
gatgtaggag 
gatcgttatg 
attggggaat 
ttgcaagacc 
gatgcgatcg 
ggaatcggtc 
tatcactggc 
gagctgatgc 
ggctccaaca 
gcgatgttcg 
gcttgtatgg 
ccgcggctcc 
gacggcaatt 
ggagccggga 
ggctgtgtag 
aaggaatagg 
acccgcgcta 
tcataaacgc 
ttggggccaa 
aggcccaggg 
gggttaggga 
ggcgttgcgt 
ggcctgggca 
aacacccccg 
tcatggctgc 
tcccggcatc 
tttcaccgtc 
gtagaggttt 
tgaatgcaat 
atagcatcac 
ccaaactcat 
gcggtggagc 
atggtcatag 
agccggaagc 
tgcgttgcgc 
aatcggccaa 
cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
tagctcacgc 
gcacgaaccc 



agccttacca 
aatcagtcac 
gacgtaacgg 
aaacgaaagc 
gtaacttccc 
tggagtggtg 
ctccatgcac 
catgccaacc 
ccagtgatcg 
tcgtcatcta 
gcgagaagaa 
tagcccagcg 
ctggcggtgt 
ccgcccagcg 
cccaggtcca 
agcgacccgc 
agcctgaact 
ccgacctgat 
ggcgtggata 
tttatcggca 
tcagcgagag 
tgcctgaaac 
ctgcggccga 
aatacactac 
a^actgtgat 
tttgggccga 
atgtcctgac 
gggattccca 
agcagcagac 
gggcgtatat 
tcgatgatgc 
ctgtcgggcg 
aagtactcgc 
ggagatgggg 
tgacggcaat 
ggggttcggt 
tacgcccgcg 
ctcgcagcca 
cggggtcccc 
ggggtctggt 
tggaccgcat 
acccccaaaa 
gccccgacac 
cgcttacaga 
atcaccgaaa 
tacttgcttt 
tgttgttgtt 
aaatttcaca 
caatgtatct 
tccagctttt 
ctgtttcctg 
ataaagtgta 
tcactgcccg 
cgcgcgggga 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 



gtaaaaaaga 
agtgtaaaaa 
ttaaagtcca 
caaaaaaccc 
ggatccgcgg 
aatccgttag 
cgcgacgcaa 
cgttccatgt 
aagttaggct 
cctgcctgga 
tcataatggg 
cgtcggccgc 
ccccggaaga 
tcttgtcatt 
cttcgcatat 
ttaacagcgt 
caccgcgacg 
gcagctctcg 
tgtcctgcgg 
ctttgcatcg 
cctgacctat 
cgaactgccc 
tcttagccag 
atggcgtgat 
ggacgacacc 
ggactgcccc 
ggacaatggc 
atacgaggtc 
gcgctacttc 
gctccgcatt 
agcttgggcg 
tacacaaatc 
cgatagtgga 
gaggctaact 
aaaaagacag 
cccagggctg 
tttcttcctt 
acgtcggggc 
catggggaat 
ccacgactgg 
gtactggcgc 
accaccgcgc 
ccgccaacac 
caagctgtga 
cgcgcgaggc 
aaaaaacctc 
aacttgttta 
aataaagcat 
tatcatgtct 
gttcccttta 
tgtgaaattg 
aagcctgggg 
ctttccagtc 
gaggcggttt 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
tgtccgcctt 
tcagttcggt 
ccgaccgctg 



aaacctatta 3600 
agggccaagt 3660 
caaaaaacac 3720 
acaacttcct 3780 
cattcacagt 3840 
cgaggtgccg 3900 
cgcggggagg 3960 
gctcgccgag 4020 
ggtaagagcc 4080 
cagcatggcc 4140 
gaaggccatc 4200 
catgccctgc 4260 
aatatatttg 4320 
ggcgaattcg 4380 
taaggtgacg 4440 
caacagcgtg 4500 
tctgtcgaga 4560 
gagggegaag 4620 
gtaaatagct 4680 
gccgcgctcc 4740 
tgcatctccc 4800 
gctgttctgc 4860 
acgagcgggt 4920 
ttcatatgcg 4980 
gtcagtgcgt 5040 
gaagtccggc 5100 
cgcataacag 5160 
gccaacatct 5220 
gagcggaggc 5280 
ggtcttgacc 5340 
cagggtcgat 5400 
gcccgcagaa 5460 
aaccgacgcc 5520 
gaaacacgga 5580 
aataaaacgc 5640 
gcactctgtc 5700 
ttccccaccc 5760 
ggcaggccct 5820 
ggtttatggt 5880 
actgagcaga 5940 
gacacgaaca 6000 
ggatttctgg 6060 
ccgctgacgc 6120 
ccgtctccgg 6180 
agccggatca 6240 
cccacctccc 6300 
ttgcagctta 6360 
ttttttcact 6420 
ggatccacta 6480 
gtgagggtta 6540 
ttatccgctc 6600 
tgcctaatga 6660 
gggaaacctg 6720 
gcgtattggg 6780 
gcggcgagcg 6840 
taacgcagga 6900 
cgcgttgctg 6960 
ctcaagtcag 7020 
aagctccctc 7080 
tctcccttcg 7140 
gtaggtcgtt 7200 
cgccttatcc 7260 
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ggtaactatc 

actggtaaca 

tggcctaact 

gttaccttcg 

ggtggttttt 

•cctttgatct 

ttggtcatga 

tttaaatcaa 

agtgaggcac 

gtcgtgtaga 

ccgcgagacc 

gccgagcgca 

cgggaagcta 

acaggcatcg 

cgatcaaggc 

cctccgatcg 

ctgcataatt 

tcaaccaagt 

atacgggata 

tcttcggggc 

actcgtgcac 

aaaacaggaa 

ctcatactct 

ggatacatat 

cgaaaagtgc 



gtcttgagtc 

ggattagcag 

acggctacac 

gaaaaagagt 

ttgtttgcaa 

tttctacggg 

gattatcaaa 

tctaaagtat 

ctatctcagc 

taactacgat 

cacgctcacc 

gaagtggtcc 

gagtaagtag 

tggtgtcacg 

gagttacatg 

ttgtcagaag 

ctcttactgt 

cattctgaga 

ataccgcgcc 

gaaaactctc 

ccaactgatc 

ggcaaaatgc 

tcctttttca 

ttgaatgtat 



caacccggta 

agcgaggtat 

tagaaggaca 

tggtagctct 

gcagcagatt 

gtctgacgct 

aaggatcttc 

atatgagtaa 

gatctgtcta 

acgggagggc 

ggctccagat 

tgcaacttta 

ttcgccagtt 

ctcgtcgttt 

atcccccatg 

taagttggcc 

catgccatcc 

atagtgtatg 

acatagcaga 

aaggatctta 

ttcagcatct 

cgcaaaaaag 

atattattga 

ttagaaaaat 



agacacgact 

gtaggcggtg 

gtatttggta 

tgatccggca 

acgcgcagaa 

cagtggaacg 

acctagatcc 

acttggtctg 

tttcgttcat 

ttaccatctg 

ttatcagcaa 

tccgcctcca 

aatagtttgc 

ggtatggctt 

ttgtgcaaaa 

gcagtgttat 

gtaagatgct 

cggcgaccga 

actttaaaag 

ccgctgttga 

tttactttca 

ggaataaggg 

agcatttatc 

aaacaaatag 



tatcgccact 

ctacagagtt 

tctgcgctct 

aacaaaccac 

aaaaaggatc 

aaaactcacg 

ttttaaatta 

acagttacca 

ccatagttgc 

gccccagtgc 

taaaccagcc 

tccagtctat 

gcaacgttgt 

cattcagctc 

aagcggttag 

cactcatggt 

tttctgtgac 

gttgctcttg 

tgctcatcat 

gatccagttc 

ccagcgtttc 

cgacacggaa 

agggttattg 

gggttccgcg 



ggcagcagcc 

cttgaagtgg 

gctgaagcca 

cgctggtagc 

tcaagaagat 

ttaagggatt 

aaaatgaagt 

atgcttaatc 

ctgactcccc 

tgcaatgata 

agccggaagg 

taattgttgc 

tgccattgct 

cggtt ccca a 

ctccttcggt 

tatggcagca 

tggtgagtac 

cccggcgtca 

tggaaaacgt 

gatgtaaccc 

tgggtgagca 

atgttgaata 

tctcatgagc 

cacatttccc 



7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8710 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<223 > Description of Artificial Sequence: primer 

<400> 5 30 

atgggatcca agatgaagcg cgcaagaccg 



< 210 > 6 
<211> 30 
<212 > DNA 

<213> Artificial Sequence . 

<223> Description of Artificial Sequence: primer 

<400> 6 30 

cataacgcgg ccgcttcttt attcttgggc 

<210> 7 
<211> 7148 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: plasmid 



<400> 7 

gacggatcgg gagatctccc gatcccctat 
ccgcatagtt aagccagtat ctgctccctg 
cgagcaaaat ttaagctaca acaaggcaag 



ggtcgactct cagtacaatc tgctctgatg 60 
cttgtgtgtt ggaggtcgct gagtagtgcg 120 
gcttgaccga caattgcatg aagaatctgc 180 
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ttagggttag 
gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gagctcggat 
ccatatgaca 
cccaatgggt 
gttacctcca 
ggcaacctta 
aacataaacc 
gccgccgcac 
accgtgcacg 
aagctagccc 
actgcctcac 
atttatacac 
gacctaaaca 
caaactaaag 
gcaggaggac 
tttgatgctc 
gcccacaact 
tccaaaaagc 
atagccatta 
cccctcaaaa 
aaactaggaa 
aatgataagc 
gagaaagatg 
gtttcagttt 
catcttatta 
gaatattgga 
ggatttatgc 
attgtcagtc 
ctaaacggta 
tgggactggt 
tcatacattg 
ctatagtgtc 
cagccatctg 
actgtccttt 

attctggggg 

catgctgggg 

agggggtatc 

cgcagcgtga 

tcctttctcg 

gggttccgat 

tcacgtagtg 

ttctttaata 

tcttttgatt 

taacaaaaat 

ccccaggctc 

aggtgtggaa 

tagtcagcaa 

tccgcccatt 

gcctctgcct 

tgcaaaaagc 

ggatcgtttc 



gcgttttgcg 
tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
ccaagatgaa 
cggaaaccgg 
ttcaagagag 
atggcatgct 
cctcccaaaa 
tggaaatatc 
ctctaatggt 
actccaaact 
tgcaaacatc 
cccctctaac 
aaaatggaaa 
ctttgaccgt 
ttactggagc 
taaggattga 
aaaaccaact 
tggatattaa 
ttgaggttaa 
atgcaggaga 
caaaaattgg 
ctggccttag 
taactttgtg 
ctaaactcac 
tggctgttaa 
taagatttga 
actttagaaa 
ctaacctatc 
aagtttactt 
cacaggaaac 
ctggccacaa 
cccaagaata 
acctaaatgc 
ttgtttgccc 
cctaataaaa 
gtggggtggg 
atgcggtggg 
cccacgcgcc 
ccgctacact 
ccacgttcgc 
ttagtgcttt 
ggccatcgcc 
gtggactctt 
tataagggat 
ttaacgcgaa 
cccaggcagg 
agtccccagg 
ccatagtccc 
ctccgcccca 
ctgagctatt 
tcccgggagc 
gcatgattga 



ctgcttcgcg 
tagtaatcaa 
cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa 
attaatacga 
gcgcgcaaga 
tcctccaact 
tccccctggg 
tgcgctcaaa 
tgtaaccact 
tgcacccctc 
cgcgggcaac 
tagcattgcc 
aggccccctc 
tactgccact 
actaggacta 
agcaactggt 
cttgggtttt 
ttctcaaaac 
aaatctaaga 
ctacaacaaa 
cctaagcact 
tgggcttgaa 
ccatggccta 
ttttgacagc 
gaccacacca 
tttggtctta 
aggcagtttg 
cgaaaatgga 
tggagatctt 
agcttatcca 
aaacggagac 
aggag ac aca 
ctacattaat 
aagaagcggc 
tagagctcgc 
ctcccccgtg 
tgaggaaatt 
gcaggacagc 
ctctatggct 
ctgtagcggc 
tgccagcgcc 
cggctttccc 
acggcacctc 
ctgatagacg 
gttccaaact 
tttggggatt 
ttaattctgt 
cagaagtatg 
ctccccagca 
gcccctaact 
tggctgacta 
ccagaagtag 
ttgtatatcc 
acaagatgga 



atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
ccgtctgaag 
gtgccttttc 
gtactctctt 
atgggcaacg 
gtgag cc ca c 
acagttacct 
acactcacca 
acccaaggac 
accaccaccg 
ggtagcttgg 
aagtacgggg 
ccaggtgtga 
gattcacaag 
agacgcctta 
ctaggacagg 
ggcctttact 
gccaaggggt 
tttggttcac 
gaatttgatt 
acaggtgcca 
gctccatctc 
acaaaatgtg 
gctccaatat 
gtgctactaa 
actgaaggca 
aaatctcacg 
aaaactaaac 
actccaagtg 
gaaatatttg 
cgctcgagca 
tgatcagcct 
ccttccttga 
gcatcgcatt 
aagggggagg 
tctgaggcgg 
gcattaagcg 
ctagcgcccg 
cgtcaagctc 
gaccccaaaa 
gtttttcgcc 
ggaacaacac 
tcggcctatt 
ggaatgtgtg 
caaagcatgc 
ggcagaagta 
ccgcccatcc 
atttttttta 
tgaggaggct 
attttcggat 
ttgcacgcag 



cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
ataccttcaa 
ttactcctcc 
tgcgcctatc 
gcctctctct 
ctctcaaaaa 
cagaagccct 
tgcaatcaca 
ccctcacagt 
atagcagtac 
gcattgactt 
ctcctttgca 
ctattaataa 
gcaatatgca 
tacttgatgt 
gccctctttt 
tgtttacagc 
tgatgtttga 
ctaatgcacc 
caaacaaggc 
ttacagtagg 
ctaactgtag 
gcagtcaaat 
ctggaacagt 
acaattcctt 
cagcctatac 
gtaaaactgc 
ctgtaacact 
catactctat 
ccacatcctc 
tgcatctaga 
cgactgtgcc 
ccctggaagg 
gtctgagtag 
attgggaaga 
aaagaaccag 
cggcgggtgt 
ctcctttcgc 
taaatcgggg 
aacttgatta 
ctttgacgtt 
tcaaccctat 
ggttaaaaaa 
tcagttaggg 
atctcaatta 
tgcaaagcat 
cgcccctaac 
tttatgcaga 
tttttggagg 
ctgatcaaga 
gttctccggc 



cgttgacatt 240 
agcccatata 300 
cccaacgacc 360 
gggactttcc 420 
catcaagtgt 480 
gcctggcatt 540 
gtattagtca 600 
tagcggtttg 660 
ttttggcacc 720 
caaatgggcg 780 
agagaaccca 840 
gcttggtacc 900 
ccccgtgtat 960 
ctttgtatcc 1020 
cgaacctcta 1080 
ggacgaggcc -1140 
aaccaagtca 1200 
aactgtggct 1260 
ggccccgcta 1320 
gtcagaagga 1380 
ccttactatc 1440 
gaaagagccc 1500 
tgtaacagac 1560 
tacttccttg 1620 
acttaatgta 1680 
tagttatccg 1740 
tataaactca 1800 
ttcaaacaat 1860 
cgctacagcc 1920 
aaacacaaat 1980 
tatggttcct 2040 
aaacaaaaat 2100 
actaaatgca 2160 
acttgctaca 2220 
tcaaagtgct 2280 
cctggaccca 2340 
aaacgctgtt 2400 
caaaagtaac 2460 
aaccattaca 2520 
gtcattttca 2580 
ttacactttt 2640 
gggccctatt 2700 
ttctagttgc 2760 
tgccactccc 2820 
gtgtcattct 2880 
caatagcagg 2940 
ctggggctct 3000 
ggtggttacg 3060 
tttcttccct 3120 
catcccttta 3180 
gggtgatggt 3240 
ggagtccacg 3300 
ctcggtctat 3360 
tgagctgatt 3420 
tgtggaaagt 3480 
gtcagcaacc 3540 
gcatctcaat 3600 
tccgcccagt 3660 
ggccgaggcc 3720 
cctaggcttt 3780 
gacaggatga 3840 
cgcttgggtg 3900 
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gagaggctat 

ttccggctgt 

ctgaatgaac 

tgcgcagctg 

gtgccggggc 

gctgatgcaa 

gcgaaacatc 

gatctggacg 

cgcatgcccg 

atggtggaaa 

cgctatcagg 

gctgaccgct 

tatcgccttc 

cgacgcccaa 

gcttcggaat 

tggagttctt 

atagcatcac 

ccaaactcat 

cgtaatcatg 

acatacgagc 

cattaattgc 

attaatgaat 

cctcgctcac 

caaaggcggt 

caaaaggcca 

ggctccgccc 

cgacaggact 

ttccgaccct 

tttctcaatg 

gctgtgtgca 

ttgagtccaa 

ttagcagagc 

gctacactag 

aaagagttgg 

tttgcaagca 

ctacggggtc 

tatcaaaaag 

aaagtatata 

tctcagcgat 

ctacgatacg 

gctcaccggc 

gtggtcctgc 

taagtagttc 

tgtcacgctc 

ttacatgatc 

tcagaagtaa 

ttactgtcat 

tctgagaata 

ccgcgccaca 

aactctcaag 

actgatcttc 

aaaatgccgc 

tttttcaata 

aatgtattta 

ctgacgtc 



tcggctatga 

cagcgcaggg 

tgcaggacga 

tgctcgacgt 

aggatctcct 

tgcggcggct 

gcatcgagcg 

aagagcatca 

acggcgagga 

atggccgctt 

acatagcgtt 

tcctcgtgct 

ttgacgagtt 

cctgccatca 

cgttttccgg 

cgcccacccc 

aaatttcaca 

caatgtatct 

gtcatagctg 

cggaagcata 

gttgcgctca 

cggccaacgc 

tgactcgctg 

aatacggtta 

gcaaaaggcc 

ccctgacgag 

ataaagatac 

gccgcttacc 

ctcacgctgt 

cgaacccccc 

cccggtaaga 

gaggtatgta 

aaggacagta 

tagctcttga 

gcagattacg 

tgacgctcag 

gatcttcacc 

tgagtaaact 

ctgtctattt 

ggagggctta 

tccagattta 

aactttatcc 

gccagttaat 

gtcgtttggt 

ccccatgttg 

gttggccgca 

gccatccgta 

gtgtatgcgg 

tagcagaact 

gatcttaccg 

agcatctttt 

aaaaaaggga 

ttattgaagc 

gaaaaataaa 



ctgggcacaa 

gcgcccggtt 

ggcagcgcgg 

tgtcactgaa 

gtcatctcac 

gcatacgctt 

agcacgtact 

ggggctcgcg 

tctcgtcgtg 

ttctggattc 

ggctacccgt 

ttacggtatc 

cttctgagcg 

cgagatttcg 

gacgccggct 

aacttgttta 

aataaagcat 

tatcatgtct 

tttcctgtgt 

aagtgtaaag 

ctgcccgctt 

gcggggagag 

cgctcggtcg 

tccacagaat 

aggaaccgta 

catcacaaaa 

caggcgtttc 

ggatacctgt 

aggtatctca 

gttcagcccg 

cacgacttat 

ggcggtgcta 

tttggtatct 

tccggcaaac 

cgcagaaaaa 

tggaacgaaa 

tagatccttt 

tggtctgaca 

cgttcatcca 

ccatctggcc 

tcagcaataa 

gcctccatcc 

agtttgcgca 

atggcttcat 

tgcaaaaaag 

gtgttatcac 

agatgctttt 

cgaccgagtt 

ttaaaagtgc 

ctgttgagat 

actttcacca 

ataagggcga 

atttatcagg 

caaatagggg 



cagacaatcg 

ctttttgtca 

ctatcgtggc 

gcgggaaggg 

cttgctcctg 

gatccggcta 

cggatggaag 

ccagccgaac 

acccatggcg 

atcgactgtg 

gatattgctg 

gccgctcccg 

ggactctggg 

attccaccgc 

ggatgatcct 

ttgcagctta 

ttttttcact 

gtataccgtc 

gaaattgtta 

cctggggtgc 

tccagtcggg 

gcggtttgcg 

ttcggctgcg 

caggggataa 

aaaaggccgc 

atcgacgctc 

cccctggaag 

ccgcctttct 

gttcggtgta 

accgctgcgc 

cgccactggc 

cagagttctt 

gcgctctgct 

aaaccaccgc 

aaggatctca 

actcacgtta 

taaattaaaa 

gttaccaatg 

tagttgcctg 

ccagtgctgc 

accagccagc 

agtctattaa 

acgttgttgc 

tcagctccgg 

cggttagctc 

tcatggttat 

ctgtgactgg 

gctcttgccc 

tcatcattgg 

ccagttcgat 

gcgtttctgg 

cacggaaatg 

gttattgtct 

ttccgcgcac 



gctgctctga 
agaccgacct 
tggccacgac 
actggctgct 
ccgagaaagt 
cctgcccatt 
ccggtcttgt 
tgttcgccag 
atgcctgctt 
gccggctggg 
aagagcttgg 
attcgcagcg 
gttcgaaatg 
cgccttctat 
ccagcgcggg 
taatggttac 
gcattctagt 
gacctctagc 
tccgctcaca 
ctaatgagtg 
aaacctgtcg 
tattgggcgc 
gcgagcggta 
cgcaggaaag 
gttgctggcg 
aagtcagagg 
ctccctcgtg 
cccttcggga 
ggtcgttcgc 
cttatccggt 
agcagccact 
gaagtggtgg 
gaagccagtt 
tggtagcggt 
agaagatcct 
agggattttg 
atgaagtttt 
cttaatcagt 
actccccgtc 
aatgataccg 
cggaagggcc 
ttgttgccgg 
cattgctaca 
ttcccaacga 
cttcggtcct 
ggcagcactg 
tgagtactca 
ggcgtcaata 
aaaacgttct 
gtaacccact 
gtgagcaaaa 
ttgaatactc 
catgagcgga 
atttccccga 



tgccgccgtg 
gtccggtgcc 
gggcgttect 
attgggcgaa 
atccatcatg 
cgaccaccaa 
cgatcaggat 
gctcaaggcg 
gccgaatatc 
tgtggcggac 
cggcgaatgg 
catcgccttc 
accgaccaag 
gaaaggttgg 
gatctcatgc 
aaataaagca 
tgtggtttgt 
tagagcttgg 
attccacaca 
agctaactca 
tgccagctgc 
tcttccgctt 
tcagctcact 
aacatgtgag 
tttttccata 
tggcgaaacc 
cgctctcctg 
agcgtggcgc 
tccaagctgg 
aactatcgtc 
ggtaacagga 
cctaactacg 
accttcggaa 
ggtttttttg 
ttgatctttt 
gtcatgagat 
aaatcaatct 
gaggcaccta 
gtgtagataa 
cgagacccac 
gagcgcagaa 
gaagctagag 
ggcatcgtgg 
tcaaggcgag 
ccgatcgttg 
cataattctc 
accaagtcat 
cgggataata 
tcggggcgaa 
cgtgcaccca 
acaggaaggc 
atactcttcc 
tacatatttg 
aaagtgccac 



3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7148 



< 210 > 8 
<211> 7469 
<212> DNA 

<213> Artificial Sequence 
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< 220 > 

<223> Description of Artificial Sequence: 



plasmid 



<400> 8 
gacggatcgg 
ccgcatagtt 
cgagcaaaat 
ttagggttag 
gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gagctcggat 
gtctttccag 
gggacctgag 
agtcacagtc 
tgtttctggc 
tggtcgaggt 
gataccttca 
cttactcctc 
ttgcgcctat 
ggcctctctc 
cctctcaaaa 
tcagaagccc 
atgcaatcac 
cccctcacag 
gatagcagta 
ggcattgact 
gctcctttgc 
actattaata 
ggcaatatgc 
atacttgatg 
ggccctcttt 
ttgtttacag 
ttgatgtttg 
cctaatgcac 
tcaaacaagg 
attacagtag 
cctaactgta 
ggcagtcaaa 
tctggaacag 
aacaattcct 
acagcctata 
ggtaaaactg 
• cctgtaacac 
gcatactcta 
gccacatcct 
atgcatctag 
tcgactgtgc 
accctggaag 
tgtctgagta 
gattgggaag 
gaaagaacca 
gcggcgggtg 
gctcctttcg 



gagatctccc 
aagccagtat 
ttaagctaca 
gcgttttgcg 
tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
ctgaattcga 
tactcttgga 
cgagtccgca 
gcaaggtagg 
ggaggtgctg 
gaggtgtggc 
accccgtgta 
cctttgtatc 
ccgaacctct 
tggacgaggc 
aaaccaagtc 
taactgtggc 
aggccccgct 
tgtcagaagg 
cccttactat 
. tgaaagagcc 
atgtaacaga 
atacttcctt 
aacttaatgt 
ttagttatcc 
ttataaactc 
cttcaaacaa 
acgctacagc 
caaacacaaa 
ctatggttcc 
gaaacaaaaa 
gactaaatgc 
tacttgctac 
ttcaaagtgc 
tcctggaccc 
caaacgctgt 
ccaaaagtaa 
taaccattac 
tgtcattttc 
cttacacttt 
agggccctat 
cttctagttg 
gtgccactcc 
ggtgtcattc 
acaatagcag 
gctggggctc 
tggtggttac 
ctttcttccc 



gatcccctat 
ctgctccctg 
acaaggcaag 
ctgcttcgcg 
tagtaatcaa 
cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa 
attaatacga 
gctcgctgtt 
tcggaaaccc 
tcgaccggat 
ctgagcaccg 
ctgatgatgt 
aggcttgaga 
tccatatgac 
ccccaatggg 
agttacctcc 
cggcaacctt 
aaacataaac 
tgccgccgca 
aaccgtgcac 
aaagctagcc 
cactgcctca 
catttataca 
cgacctaaac 
gcaaactaaa 
agcaggagga 
gtttgatgct 
agcccacaac 
ttccaaaaag 
catagccatt 
tcccctcaaa 
taaactagga 
taatgataag 
agagaaagat 
agtttcagtt 
tcatcttatt 
agaatattgg 
tggatttatg 
cattgtcagt 
actaaacggt 
atgggactgg 
ttcatacatt 
tctatagtgt 
ccagccatct 
cactgtcctt 
tattctgggg 
gcatgctggg 
tagggggtat 
gcgcagcgtg 
ttcctttctc 



ggtcgactct 
cttgtgtgtt 
gcttgaccga 
atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
gggctcgcgg 
gtcggcctcc 
cggaaaacct 
tggcgggcgg 
aattaaagta 
tccaagatga 
acggaaaccg 
tttcaagaga 
aatggcatgc 
acctcccaaa 
ctggaaatat 
cctctaatgg 
gactccaaac 
ctgcaaacat 
ccccctctaa 
caaaatggaa 
actttgaccg 
gttactggag 
ctaaggattg 
caaaaccaac 
ttggatatta 
cttgaggtta 
aatgcaggag 
acaaaaattg 
actggcctta 
ctaactttgt 
gctaaactca 
ttggctgtta 
ataagatttg 
aactttagaa 
cctaacctat 
caagtttact 
acacaggaaa 
tctggccaca 
gcccaagaat 
cacctaaatg 
gttgtttgcc 
tcctaataaa 
ggtggggtgg 

gatgcggtgg 
ccccacgcgc 
accgctacac 
gccacgttcg 



cagtacaatc 
ggaggtcgct 
caattgcatg 
cagatatacg , 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
ttgaggacaa 
gaacggtact 
ctcgagaaag 
cagcgggtgg 
ggcggtcttg 
agcgcgcaag 
gtcctccaac 
gtccccctgg 
ttgcgctcaa 
atgtaaccac 
ctgcacccct 
tcgcgggcaa 
ttagcattgc 
caggccccct 
ctactgccac 
aactaggact 
tagcaactgg 
ccttgggttt 
attctcaaaa 
taaatctaag 
actacaacaa 
acctaagcac 
atgggcttga 
gccatggcct 
gttttgacag 
ggaccacacc 
ctttggtctt 
aaggcagttt 
acgaaaatgg 
atggagatct 
cagcttatcc 
taaacggaga 
caggagacac 
actacattaa 
aaagaagcgg 
ctagagctcg 
cctcccccgt 
atgaggaaat 
ggcaggacag 
gctctatggc 
cctgtagcgg 
ttgccagcgc 
ccggctttcc 



tgctctgatg 

gagtagtgcg 

aagaatctgc 

cgttgacatt 

agcccatata 

cccaacgacc 

gggactttcc 

catcaagtgt 

gcctggcatt 

gtattagtca 

tagcggtttg 

ttttggcacc 

caaatgggcg 

agagaaccca 

gcttggtacc 

actcttcgcg 

ccgccaccga 

gcgtctaacc 

cggtcggggt 

agacggcgga 

accgtctgaa 

tgtgcctttt 

ggtactctct 

aatgggcaac 

tgtgagccca 

cacagttacc 

cacactcacc 

cacccaagga 

caccaccacc 

tggtagcttg 

aaagtacggg 

tccaggtgtg 

tgattcacaa 

cagacgcctt 

actaggacag 

aggcctttac 

tgccaagggg 

atttggttca 

agaatttgat 

cacaggtgcc 

agctccatct 

aacaaaatgt 

ggctccaata 

agtgctacta 

tactgaaggc 

aaaatctcac 

caaaactaaa 

aactccaagt 

tgaaatattt 

ccgctcgagc 

ctgatcagcc 

gccttccttg 

tgcatcgcat 

caagggggag 

ttctgaggcg 

cgcattaagc 

cctagcgccc 

ccgtcaagct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

.2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 
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ctaaatcggg 

aaacttgatt 

cctttgacgt 

ctcaacccta 

tggttaaaaa 

gtcagttagg 

catctcaatt 

atgcaaagca 

ccgcccctaa 

atttatgcag 

ttttttggag 

tctgatcaag 

ggttctccgg 

ggctgctctg 

aagaccgacc 

ctggccacga 

gactggctgc 

gccgagaaag 

acctgcccat 

gccggtcttg 

ctgttcgcca 

gatgcctgct 

ggccggctgg 

gaagagcttg 

gattcgcagc 

ggttcgaaat 

ccgccttcta' 

tccagcgcgg 

ataatggtta 

tgcattctag 

cgacctctag 

atccgctcac 

cctaatgagt 

gaaacctgtc 

gtattgggcg 

ggcgagcggt 

acgcaggaaa 

cgttgctggc 

caagtcagag 

gctccctcgt 

tcccttcggg 

aggtcgttcg 

ccttatccgg 

cagcagccac 

tgaagtggtg 

tgaagccagt 

ctggtagcgg 

aagaagatcc 

aagggatttt 

aatgaagttt 

gcttaatcag 

gactccccgt 

caatgatacc 

ccggaagggc 

attgttgccg 

ccattgctac 

gttcccaacg 

ccttcggtcc 

tggcagcact 

gtgagtactc 

cggcgtcaat 

gaaaacgttc 



gcatcccttt 

agggtgatgg 

tggagtccac 

tctcggtcta 

atgagctgat 

gtgtggaaag 

agtcagcaac 

tgcatctcaa 

ctccgcccag 

aggccgaggc 

gcctaggctt 

agacaggatg 

ccgcttgggt 

atgccgccgt 

tgtccggtgc 

cgggcgttcc 

tattgggcga 

tatccatcat 

tcgaccacca 

tcgatcagga 

ggctcaaggc 

tgccgaatat 

gtgtggcgga 

gcggcgaatg 

gcatcgcctt 

gaccgaccaa 

tgaaaggttg 

ggatctcatg 

caaataaagc 

ttgtggtttg 

ctagagcttg 

aattccacac 

gagctaactc 

gtgccagctg 

ctcttccgct 

atcagctcac 

gaacatgtga 

gtttttccat 

gtggcgaaac 

gcgctctcct 

aagcgtggcg 

ctccaagctg 

taactatcgt 

tggtaacagg 

gcctaactac 

taccttcgga 

tggttttttt 

tttgatcttt 

ggt cat gaga 

taaatcaatc 

tgaggcacct 

cgtgtagata 

gcgagaccca 

cgagcgcaga 

ggaagctaga 

aggcatcgtg 

atcaaggcga 

tccgatcgtt 

gcataattct 

aaccaagtca 

acgggataat 

ttcggggcga 



agggttccga 

ttcacgtagt 

gttctttaat 

ttcttttgat 

ttaacaaaaa 

tccccaggct 

caggtgtgga 

ttagtcagca 

ttccgcccat 

cgcctctgcc 

ttgcaaaaag 

aggatcgttt 

ggagaggcta 

gttccggctg 

cctgaatgaa 

ttgcgcagct 

agtgccgggg 

ggctgatgca 

agcgaaacat 

tgatctggac 

gcgcatgccc 

catggtggaa 

ccgctatcag 

ggctgaccgc 

ctatcgcctt 

gcgacgccca 

ggcttcggaa 

ctggagttct 

aatagcatca 

tccaaactca 

gcgtaatcat 

aacatacgag 

acattaattg 

cattaatgaa 

tcctcgctca 

tcaaaggcgg 

gcaaaaggcc 

aggctccgcc 

ccgacaggac 

gttccgaccc 

ctttctcaat 

ggctgtgtgc 

cttgagtcca 

attagcagag 

ggctacacta 

aaaagagttg 

gtttgcaagc 

tctacggggt 

ttatcaaaaa 

taaagtatat 

atctcagcga 

actacgatac 

cgctcaccgg 

agtggtcctg 

gtaagtagtt 

gtgtcacgct 

gttacatgat 

gtcagaagta 

cttactgtca 

ttctgagaat 

accgcgccac 

aaactctcaa 



tttagtgctt 

gggccatcgc 

agtggactct 

ttataaggga 

tttaacgcga 

ccccaggcag 

aagtccccag 

accatagtcc 

tctccgcccc 

tctgagctat 

ctcccgggag 

cgcatgattg 

ttcggctatg 

tcagcgcagg 

ctgcaggacg 

gtgctcgacg 

caggatctcc 

atgcggcggc 

cgcatcgagc 

gaagagcatc 

gacggcgagg 

aatggccgct 

gacatagcgt 

ttcctcgtgc 

cttgacgagt 

acctgccatc 

tcgttttccg 

tcgcccaccc 

caaatttcac 

tcaatgtatc 

ggtcatagct 

ccggaagcat 

cgttgcgctc 

tcggccaacg 

ctgactcgct 

taatacggtt 

agcaaaaggc 

cccctgacga 

tataaagata 

tgccgcttac 

gctcacgctg 

acgaaccccc 

acccggtaag 

cgaggtatgt 

gaaggacagt 

gtagctcttg 

agcagattac 

ctgacgctca 

ggatcttcac 

atgagtaaac 

tctgtctatt 

gggagggctt 

ctccagattt 

caactttatc 

cgccagttaa 

cgtcgtttgg 

cccccatgtt 

agttggccgc 

tgccatccgt 

agtgtatgcg 

atagcagaac 

ggatcttacc 



tacggcacct 

cctgatagac 

tgttccaaac 

ttttggggat 

attaattctg 

gcagaagtat 

gctccccagc 

cgcccctaac 

atggctgact 

tccagaagta 

cttgtatatc 

aacaagatgg 

actgggcaca 

ggcgcccggt 

aggcagcgcg 

ttgtcactga 

tgtcatctca 

tgcatacgct 

gagcacgtac 

aggggctcgc 

atctcgtcgt 

tttctggatt 

tggctacccg 

tttacggtat 

tcttctgagc 

acgagatttc 

ggacgccggc 

caacttgttt 

aaataaagca 

ttatcatgtc 

gtttcctgtg 

aaagtgtaaa 

actgcccgct 

cgcggggaga 

gcgctcggtc 

atccacagaa 

caggaaccgt 

gcatcacaaa 

ccaggcgttt 

cggatacctg 

taggtatctc 

cgttcagccc 

acacgactta 

aggcggtgct 

atttggtatc 

atccggcaaa 

gcgcagaaaa 

gtggaacgaa 

ctagatcctt 

ttggtctgac 

tcgttcatcc 

accatctggc 

atcagcaata 

cgcctccatc 

tagtttgcgc 

tatggcttca 

gtgcaaaaaa 

agtgttatca 

aagatgcttt 

gcgaccgagt 

tttaaaagtg 

gctgttgaga 



cgaccccaaa 

ggtttttcgc 

tggaacaaca 

ttcggcctat 

tggaatgtgt 

gcaaagcatg 

aggcagaagt 

tccgcccatc 

aatttttttt 

gtgaggaggc 

cattttcgga 

attgcacgca 

acagacaatc 

tctttttgtc 

gctatcgtgg 

agcgggaagg 

ccttgctcct 

tgatccggct 

tcggatggaa 

gccagccgaa 

gacccatggc 

catcgactgt 

tgatattgct 

cgccgctccc 

gggactctgg 

gattccaccg 

tggatgatcc 

attgcagctt 

tttttttcac 

tgtataccgt 

tgaaattgtt 

gcctggggtg 

ttccagtcgg 

ggcggtttgc 

gttcggctgc 

tcaggggata 

aaaaaggccg 

aatcgacgct 

ccccctggaa 

tccgcctttc 

agttcggtgt 

gaccgctgcg 

tcgccactgg 

acagagttct 

tgcgctctgc 

caaaccaccg 

aaaggatctc 

aactcacgtt 

ttaaattaaa 

agttaccaat 

atagttgcct 

cccagtgctg 

aaccagccag 

cagtctatta 

aacgttgttg 

ttcagctccg 

gcggttagct 

ctcatggtta 

tctgtgactg 

tgctcttgcc 

ctcatcattg 

tccagttcga 



3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 
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tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 7260 

ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 7320 

gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 7380 

tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 7440 

catttccccg aaaagtgcca cctgacgtc 7469 

<210 > 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223 > Description of Artificial Sequence: primer 
<400> 9 

tgcttaagcg gccgcgaagg agaagtcc 28 

<210 > 10 
<211> 23 
<212 > DNA 

<213 > Artificial Sequence 
<22Q> 

<223 > Description of Artificial Sequence; primer 
<4 00> 10 

ccgagctagc gactgaaaat gag 23 



< 210 > 11 
<211> 23 
<212> DNA 

<213 > Artificial Sequence 



< 220 > 

<223 > Description of Artificial Sequence: primer 



<400> 11 

cctctcgaga gacagcaaga cac 23 



< 210 > 12 
<211> 11152 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> Description of Artificial Sequence: plasmid 
<400> 12 

aagcttgggc agaaatggtt gaactcccga gagtgtccta cacctagggg agaagcagcc 60 

aaggggttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag cccatcagac 120 

aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 

9 S 99 a agttg cggttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 240 

gtgcaagatt acaatctaaa caattcggag aactcgacct tcctcctgag gcaaggacca 300 

cagccaactt cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 360 

cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 

atgttaagaa atgaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 

atgggaatag aaaatagaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 

ttgaccacag gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac 600 

aggtggtggc aaccagggac ttatagggga ccttacatct acagaccaac agatgccccc 660 

ttaccatata caggaagata tgacttaaat tgggataggt gggttacagt caatggctat 720 




10 



PCT/EP01/04863 




WO 01/83729 



aaagtgttat atagatccct cccttttcgt gaaagactcg ccagagctag acctccttgg 780 

tgtatgttgt ctcaagaaga aaaagacgac atgaaacaac aggtacatga ttatatttat 840 

ctaggaacag gaatgcactt ttggggaaag attttccata ccaaggaggg gacagtggct 900 

ggactaatag aacattattc tgcaaaaact catggcatga gttattatga atagccttta 960 

ttggcccaac cttgcggttc ccagggctta agtaagtttt tggttacaaa ctgttcttaa 1020 

aacgaggatg tgagacaagt ggtttcctga cttggtttgg tatcaaaggt tctgatctga 1080 

gctctgagtg ttctattttc ctatgttctt ttggaattta tccaaatctt atgtaaatgc 1140 

ttatgtaaac caagatataa aagagtgctg attttttgag taaacttgca acagtcctaa 1200 

cattcacctc ttgtgtgttt gtgtctgttc gccatcccgt ctccgctcgt cacttatcct 1260 

tcactttcca gagggtcccc ccgcagaccc cggcgaccct caggtcggcc gactgcggca 1320 

gctggcgccc gaacagggac cctcggataa gtgacccttg tctctatttc tactatttgg 1380 

tgtttgtctt gtattgtctc tttcttgtct ggctatcatc acaagagcgg aacggactca 1440 

ccatagggac caagctagcg actgaaaatg agacatatta tctgccacgg aggtgttatt 1500 

accgaagaaa tggccgccag tcttttggac cagctgatcg aagaggtact ggctgataat 1560 

cttccacctc ctagccattt tgaaccacct acccttcacg aactgtatga tttagacgtg 1620 

acggcccccg aagatcccaa cgaggaggcg gtttcgcaga tttttcccga ctctgtaatg 1680 

ttggcggtgc aggaagggat tgacttactc acttttccgc cggcgcccgg ttctccggag 1740 

ccgcctcacc tttcccggca gcccgagcag ccggagcaga gagccttggg tccggtttct 1800 

atgccaaacc ttgtaccgga ggtgatcgat cttacctgcc acgaggctgg ctttccaccc 1860 

agtgacgacg aggatgaaga gggtgaggag tttgtgttag attatgtgga gcaccccggg 1920 

cacggttgca ggtcttgtca ttatcaccgg aggaatacgg gggacccaga tattatgtgt 1980 

tcgctttgct atatgaggac ctgtggcatg tttgtctaca gtaagtgaaa attatgggca 2040 

gtgggtgata gagtggtggg tttggtgtgg taattttttt tttaattttt acagttttgt 2100 

ggtttaaaga attttgtatt gtgatttttt taaaaggtcc tgtgtctgaa cctgagcctg 2160 

agcccgagcc agaaccggag cctgcaagac ctacccgccg tcctaaaatg gcgcctgcta 2220 

tcctgagacg cccgacatca cctgtgtcta gagaatgcaa tagtagtacg gatagctgtg 2280 

actccggtcc ttctaacaca cctcctgaga tacacccggt ggtcccgctg tgccccatta 2340 

aaccagttgc cgtgagagtt ggtgggcgtc gccaggctgt ggaatgtatc gaggacttgc 2400 

ttaacgagcc tgggcaacct ttggacttga gctgtaaacg ccccaggcca taaggtgtaa 2460 

acctgtgatt gcgtgtgtgg ttaacgcctt tgtttgctga atgagttgat gtaagtttaa 2520 

taaagggtga gataatgttt aacttgcatg gcgtgttaaa tggggcgggg cttaaagggt 2580 

atataatgcg ccgtgggcta atcttggtta catctgacct catggaggct tgggagtgtt 2640 

tggaagattt ttctgctgtg cgtaacttgc tggaacagag ctctaacagt acctcttggt 2700 

tttggaggtt tctgtggggc tcatcccagg caaagttagt ctgcagaatt aaggaggatt 2760 

acaagtggga atttgaagag cttttgaaat cctgtggtga gctgtttgat tctttgaatc 2820 

tgggtcacca ggcgcttttc caagagaagg tcatcaagac tttggatttt tccacaccgg 2880 

ggcgcgctgc ggctgctgtt gcttttttga gttttataaa ggataaatgg agcgaagaaa 2940 

cccatctgag cggggggtac ctgctggatt ttctggccat gcatctgtgg agagcggttg 3000 

tgagacacaa gaatcgcctg ctactgttgt cttccgtccg cccggcgata ataccgacgg 3060 

aggagcagca gcagcagcag gaggaagcca ggcggcggcg gcaggagcag agcccatgga 3120 

acccgagagc cggcctggac cctcgggaat gaatgttgta caggtggctg aactgtatcc 3180 

agaactgaga cgcattttga caattacaga ggatgggcag gggctaaagg gggtaaagag 3240 

ggagcggggg gcttgtgagg ctacagagga ggctaggaat ctagctttta gcttaatgac 3300 

cagacaccgt cctgagtgta ttacttttca acagatcaag gataattgcg ctaatgagct 3360 

tgatctgctg gcgcagaagt attccataga gcagctgacc acttactggc tgcagccagg 3420 

ggatgatttt gaggaggcta ttagggtata tgcaaaggtg gcacttaggc cagattgcaa 3480 

gtacaagatc agcaaacttg taaatatcag gaattgttgc tacatttctg ggaacggggc 3540 

cgaggtggag atagatacgg aggatagggt ggcctttaga tgtagcatga taaatatgtg 3600 

gccgggggtg cttggcatgg acggggtggt tattatgaat gtaaggttta ctggccccaa 3660 

ttttagcggt acggttttcc tggccaatac caaccttatc ctacacggtg taagcttcta 3720 

tgggtttaac aatacctgtg tggaagcctg gaccgatgta agggttcggg gctgtgcctt 3780 

ttactgctgc tggaaggggg tggtgtgtcg ccccaaaagc agggcttcaa ttaagaaatg 3840 

cctctttgaa aggtgtacct tgggtatcct gtctgagggt aactccaggg tgcgccacaa 3900 

tgtggcctcc gactgtggtt gcttcatgct agtgaaaagc gtggctgtga ttaagcataa 3960 

catggtatgt ggcaactgcg aggacagggc ctctcagatg ctgacctgct cggacggcaa 4020 

ctgtcacctg ctgaagacca ttcacgtagc cagccactct cgcaaggcct ggccagtgtt 4080 

tgagcataac atactgaccc gctgttcctt gcatttgggt aacaggaggg gggtgttcct 4140 

accttaccaa tgcaatttga gtcacactaa gatattgctt gagcccgaga gcatgtccaa 4200 

ggtgaacctg aacggggtgt ttgacatgac catgaagatc tggaaggtgc tgaggtacga 4260 

tgagacccgc accaggtgca gaccctgcga gtgtggcggt aaacatatta ggaaccagcc 4320 

tgtgatgctg gatgtgaccg aggagctgag gcccgatcac ttggtgctgg cctgcacccg 4380 

cgctgagttt ggctctagcg atgaagatac agattgaggt actgaaatgt gtgggcgtgg 4440 
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cttaagggtg 

agcagccgcc 

gacaacgcgc 

tggtcgcccc 

gccgttggag 

tgtgactgac 

ccgcgatgac 

tgtcgtttct 

ccctcccaat 

gcaagtgtct 

cataattgga 

gtgtataatg 

gaactgatga 

aagaaatgcc 

aaaagaagag 

gtcatgctgt 

aagctgcact 

ataacagtta 

ctattaataa 

ataaggaata 

gtagaggttt 

atgaatgcaa 

aatagcatca 

tccaaactca 

gggtgtggaa 

tagtcagcaa 

atgcatctca 

actccgccca 

gaggccgagg 

ggcctaggct 

accactttat 

actggttttt 

tttttaagcc 

ctgggacatg 

atggaaaggc 

tgaactgggt 

gcgcgagctt 

tgacctggtg 

ctttgtcacc 

tatcccgcaa 

aatctccggt 

caggcgggtt 

acctgagcga 

gccgctttaa 

ctcactggta 

aaatcctcga 

tatacgatac 

tttgtgaagg 

taaagctcta 

attgtttgtg 

gcctttaatg 

actgctgact 

gactttcctt 

gcttgctttg 

gaaaaatatt 

tttcttactc 

acctttagct 

actagagatc 

cccacacctc 

tattgcagct 

atttttttca 

c tggatcccc 



ggaaagaata 

gccgccatga 

atgcccccat 

gtcctgcccg 

actgcagcct 

tttgctttcc 

aagttgacgg 

cagcagctgt 

gcggtttaaa 

tgctgtctct 

caaactacct 

tgttaaacta 

atgggagcag 

atctagtgat 

aaaggtagaa 

gtttagtaat 

gctatacaag 

taatcataac 

ctatgctcaa 

tttgatgtat 

tacttgcttt 

ttgttgttgt 

caaatttcac 

tcaatgtatc 

agtccccagg 

ccaggtgtgg 

attagtcagc 

gttccgccca 

ccgcctcggc 

tttgcaaaaa 

cccgcgtcag 

agtgcgccag 

gtagataaac 

ttgcagatcc 

att.attgccg 

attcgtcatg 

aaagtgctga 

gataccggtg 

atcttcgcaa 

gatacctgga 

cgctaatctt 

acaatagttt 

aaccctgttc 

tcacggcgca 

tcgcatgatt 

cgtccaggca 

ggtgattggc 

aaccttactt 

aggtaaatat 

tattttagat 

aggaaaacct 

ctcaacattc 

cagaattgct 

ctatttacac 

ctgtaacctt 

cacacaggca 

ttttaatttg 

ataatcagcc 

cccctgaacc 

tataatggfct 

ctgcattcta 

aggaagctcc 



tataaggtgg 

gcaccaactc 

gggccggggt 

caaactctac 

ccgccgccgc 

tgagcccgct 

ctcttttggc 

tggatctgcg 

acataaataa 

c gagggatct 

acagagattt 

ctgattctaa 

tggtggaatg 

gatgaggcta 

gaccccaagg 

agaactcttg 

aaaattatgg 

atactgtttt 

aaattgtgta 

agtgccttga 

aaaaaacctc 

taacttgttt 

aaataaagca 

ttatcatgtc 

ctccccagca 

aaagtcccca 

aaccatagtc 

ttctccgccc 

ctctgagcta 

gcttggacac 

ggagaggcag 

atctctataa 

aggctgggac 

atgcacgtaa 

taagccgtgg 

tcgataccgt 

aacgcgcaga 

gtactgcggt 

aaccggctgg 

ttgaacagcc 

ttcaacgcct 

ccagtaagta 

aaaccccgct 

caaccgcctg 

aaccgtctga 

cgtattgtga 

taccgtggcg 

ctgtggtgtg 

aaaattttta 

tccaacctat 

gttttgctca 

tactcctcca 

aagttttttg 

cacaaaggaa 

tataagtagg 

tagagtgtct 

taaaggggtt 

ataccacatt 

tgaaacataa 

acaaataaag 

gttgtggttt 

tctgtgtcct 



.gggtcttatg 

gtttgatgga 

gcgtcagaat 

taccttgacc 

ttcagccgct 

tgcaagcagt 

acaattggat 

ccagcaggtt 

aaaaccagac 

ttgtgaagga 

aaagctctaa 

ttgtttgtgt 

cctttaatga 

ctgctgactc 

actttccttc 

cttgctttgc 

aaaaatattc 

ttcttactcc 

cctttagctt 

ctagagatca 

ccacacctcc 

attgcagctt 

tttttttcac 

tggatccggc 

ggcagaagta 

ggctccccag 

ccgcccctaa 

catggctgac 

ttccagaagt 

aagacaggct 

tgcgtaaaaa 

tctcgcgcaa 

acttcacatg 

actcgcaagc 

cggtctggta 

ttgtatttcc 

aggcgatggc 

tgcgattcgt 

tcgtccgctg 

gtgggatatg 

ggcactgccg 

ttctggaggc 

ttaaacatcc 

tgcagtcggc 

tgtggatctg 

tgagcgatgc 

gcaactggat 

acataattgg 

a gtgtataat 

ggaactgatg 

gaagaaatgc 

aaaaagaaga 

agtcatgctg 

aaagctgcac 

cataacagtt 

gctattaata 

aataaggaat 

tgtagaggtt 

aatgaatgca 

caatagcatc 

gtccaaactc 

cataaaccct 



tagttttgta 

agcattgtga 

gtgatgggct 

tacgagaccg 

gcagccaccg 

gcagcttccc 

tctttgaccc 

tctgccctga 

tctgtttgga 

accttacttc 

ggtaaatata 

attttagatt 

ggaaaacctg 

tcaacattct 

agaattgcta 

tatttacacc 

tgtaaccttt 

acacaggcat 

tttaatttgt 

taatcagcca 

ccctgaacct 

ataatggtta 

tgcattctag 

tgtggaatgt 

tgcaaagcat 

caggcagaag 

ctccgcccat 

taattttttt 

agtgaggagg 

tgcgagatat 

gacgcggact 

cctattttcc 

agcgaaaaat 

cgactgatgc 

ccgggtgcgt 

agctacgatc 

gaaggcttca 

gaaatgtatc 

gttgatgact 

ggcgtcgtat 

ggcgttgttc 

tgcatccatg 

tgaaacctcg 

ccttgatggt 

gcgcggcatt 

cgaacgtacc 

ttatgagtgg 

acaaactacc 

gtgttaaact 

aatgggagca 

catctagtga 

gaaaggtaga 

tgtttagtaa 

tgctatacaa 

ataatcataa 

actatgctca 

atttgatgta 

ttacttgctt 

attgttgttg 

acaaatttca 

atcaatgtat 

aacctcctct 



tctgttttgc 

gctcatattt 

ccagcattga 

tgtctggaac 

cccgcgggat 

gttcatccgc 

gggaacttaa 

aggcttcctc 

tttggatcaa 

tgtggtgtga 

aaatttttaa 

ccaacctatg 

ttttgctcag 

actcctccaa 

agttttttga 

acaaaggaaa 

ataagtaggc 

agagtgtctg 

aaaggggtta 

taccacattt 

gaaacataaa 

caaataaagc 

ttgtggtttg 

gtgtcagtta 

gcatctcaat 

tatgcaaagc 

cccgccccta 

tatttatgca 

cttttttgga 

gtttgagaat 

catgtgaaat 

cctcgaacac 

acatcgtcac 

cttctgaaca 

tactggcgcg 

acgacaacca 

tcgttattga 

caaaagcgca 

atgttgttga 

tcgtcccgcc 

tttttaactt 

acacaggcaa 

acgctagtcc 

aaaaccatcc 

gacccacgcg 

gacgatgatt 

gccccggatc 

tacagagatt 

actgattcta 

gtggtggaat 

tgatgaggct 

agaccccaag 

tagaactctt 

gaaaattatg 

catactgttt 

aaaattgtgt 

tagtgccttg 

taaaaaacct 

ttaacttgtt 

caaataaagc 

cttatcatgt 

acttgagagg 



4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 
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acattccaat 

acaaaaagga 

atctgggaag 

caacagcaga 

gaagcactgt 

tgtaggttcc 

ctggataagc 

caactgtagc 

acacaccctg 

gaatgggttt 

tagcagttac 

cacaggttaa 

gatacgccta 

cacttttcgg 

tatgtatccg 

gagtatgagt 

tcctgttttt 

tgcacgagtg 

ccccgaagaa 

atcccgtgtt 

cttggttgag 

attatgcagt 

gatcggagga 

ccttgatcgt 

gatgcctgca 

agcttcccgg 

gcgctcggcc 

gtctcgcggt 

ctacacgacg 

tgcctcactg 

tgatttaaaa 

catgaccaaa 

gatcaaagga 

aaaaccaccg 

gaaggtaact 

gttaggccac 

gttaccagtg 

atagttaccg 

cttggagcga 

cacgcttccc 

agagcgcacg 

tcgccacctc 

gaaaaacgcc 

catgttcttt 

agctgatacc 

ggaagagcgc 

a tggtgcact 

ctatcgctac 

ccctgacggg 

agctgcatgt 



cataggctgc 
aattgggtag 
tcccttccac 
aacatacaag 
ggttgctgtg 
aaaatatcta 
attatcctta 
attttttggg 
cagctccaaa 
tccagcacca 
cccaataacc 
gtcctcattt 
tttttatagg 
ggaaatgtgc 
ctcatgagac 
attcaacatt 
gctcacccag 
ggttacatcg 
cgttttccaa 
gacgccgggc 
tactcaccag 
gctgccataa 
ccgaaggagc 
tSggaaccgg 
gcaatggcaa 
caacaattaa 
cttccggctg 
atcattgcag 
gggagtcagg 
attaagcatt 
cttcattttt 
atcccttaac 
tcttcttgag 
ctaccagcgg 
ggcttcagca 
cacttcaaga 
gctgctgcca 
gataaggcgc 
acgacctaca 
gaagggagaa 
agggagcttc 
tgacttgagc 
agcaacgcgg 
cctgcgttat 
gctcgccgca 
ctgatgcggt 
ctcagtacaa 
gtgactgggt 
cttgtctgct 
gtcagaggtt 



ccatccaccc 

gggtttttca 

tgctgtgttc 

ctgtcagctt 

ttagtaatgt 

gtgttttcat 

tccaaaacag 

gttacagttt 

ggttccccac 

ttttcatgag 

tcagttttaa 

aaattaggca 

ttaatgtcat 

gcggaacccc 

aataaccctg 

tccgtgtcgc 

aaacgctggt 

aactggatct 

tgatgagcac 

aagagcaact 

tcacagaaaa 

ccatgagtga 

taaccgcttt 

agctgaatga 

caacgttgcg 

tagactggat 

gctggtttat 

cactggggcc 

caactatgga 

ggtaactgtc 

aatttaaaag 

gtgagttttc 

atcctttttt 

tggtttgttt 

gagcgcagat 

actctgtagc 

gtggcgataa 

agcggtcggg 

ccgaactgag 

aggcggacag 

cagggggaaa 

gtcgattttt 

cctttttacg 

cccctgattc 

gccgaacgac 

attttctcct 

tctgctctga 

catggctgcg 

cccggcatcc 

ttcaccgtca 



tctgtgtcct 

cagaccgctt 

cagaagtgtt 

tgcacaaggg 

gcaaaacagg 

ttttacttgg 

ccttgtggtc 

gagcaggata 

caacagcaaa 

ttttttgtgt 

cagtaacagc 

aaggaattct 

gataataatg 

tatttgttta 

ataaatgctt 

ccttattccc 

gaaagtaaaa 

caacagcggt 

ttttaaagtt 

cggtcgccgc 

gcatcttacg 

taacactgcg 

tttgcacaac 

agccatacca 

caaactatta 

ggaggcggat 

tgctgataaa 

a 9 a tggtaag 

tgaacgaaat 

agaccaagtt 

gatctaggtg 

gttccactga 

tctgcgcgta 

gccggatcaa 

accaaatact 

accgcctaca 

gtcgtgtctt 

ctgaacgggg 

atacctacag 

gtatccggta 

cgcctggtat 

gtgatgctcg 

gttcctggcc 

tgtggataac 

cgagcgcagc 

tacgcatctg 

tgccgcatag 

ccccgacacc 

gcttacagac 

tcaccgaaac 



<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> Description of Artificial Sequence: primer 
<400> 13 

gacggatcgg gagatctcc 



cctgttaatt 

tctaagggta 

ggtaaacagc 

cccaacaccc 

aggcacattt 

atcaggaacc 

agtgttcatc 

tttggtcctg 

aaaatgaaaa 

ccctgaatgc 

ttcccacatc 

tgaagacgaa 

gtttcttaga 

tttttctaaa 

caataatatt 

ttttttgcgg 

gatgctgaag 

aagatccttg 

ctgctatgtg 

atacactatt 

gatggcatga 

gccaacttac 

at gggggatc 

aacgacgagc 

actggcgaac 

aaagttgcag 

tctggagccg 

ccctcccgta 

agacagatcg 

tactcatata 

aagatccttt 

gcgtcagacc 

atctgctgct 

gagctaccaa 

gtccttctag 

tacctcgctc 

accgggttgg 

ggttcgtgca 

cgtgagctat 

agcggcaggg 

ctttatagtc 

tcaggggggc 

ttttgctggc 

cgtattaccg 

gagtcagtga 

tgcggtattt 

ttaagccagt 

cgccaacacc 

aagctgtgac 

gcgcgaggca 



aggtcactta 

attttaaaat 

ccacaaatgt 

tgctcatcaa 

tccccacctg 

cagcactcca 

tgctgactgt 

tagtttgcta 

tttgaccctt 

aagtttaaca 

aaaatatttc 

agggcctcgt 

cgtcaggtgg 

tacattcaaa 

gaaaaaggaa 

cattttgcct 

atcagttggg 

agagttttcg 

gcgcggtatt 

ctcagaatga 

cagtaagaga 

ttctgacaac 

atgtaactcg 

gtgacaccac 

tacttactct 

gaccacttct 

gtgagcgtgg 

tcgtagttat 

ctg a gatagg 

tactttagat 

ttgataatct 

ccgtagaaaa 

tgcaaacaaa 

ctctttttcc 

tgtagccgta 

tgctaatcct 

actcaagacg 

cacagcccag 

gagaaagcgc 

tcggaacagg 

ctgtcgggtt 

ggagcctatg 

cttttgctca 

cctttgagtg 

gcgaggaagc 

cacaccgcat 

atacactccg 

cgctgacgcg 

cgtctccggg 

gc 



8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11152 
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<210> 14 
< 211 > 22 
<212> DNA 

<213 > Artificial Sequence 

<223> Description of Artificial Sequence: primer 

<400> 14 22 

ccgcctcaga agccatagag cc 



<210> 15 
<211> 14455 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: plasmid 



<400> 15 
aagcttg ggc 
aaggggttgt 
aaagacatat 
ggggaagttg 
gtgcaagatt 
cagccaactt 
cttgctaaaa 
atgttaagaa 
atgggaatag 
ttgaccacag 
aggtggtggc 
ttaccatata 
aaagtgttat 
tgtatgttgt 
ctaggaacag 
ggactaatag 
ttggcccaac 
aacgaggatg 
gctctgagtg 
ttatgtaaac 
cattcacctc 
tcactttcca 
gctggcgccc 
tgtttgtctt 
ccatagggac 
accgaagaaa 
cttccacctc 
acggcccccg 
ttggcggtgc 
ccgcctcacc 
atgccaaacc 
agtgacgacg 
cacggttgca 
tcgctttgct 
gtgggtgata 
ggtttaaaga 
agcccgagcc 
tcctgagacg 
actccggtcc 
aaccagttgc 
ttaacgagcc 



agaaatggtt 
ttcccaccaa 
tcattctctg 
cggttcgtgc 
acaatctaaa 
cctcttacaa 
attatatttt 
atgaatcatt 
aaaatagaaa 
gcctagaagt 
aaccagggac 
caggaagata 
atagatccct 
ctcaagaaga 
gaatgcactt 
aacattattc 
cttgcggttc 
tgagacaagt 
ttctattttc 
caagatataa 
ttgtgtgttt 
gagggtcccc 
gaacagggac 
gtattgtctc 
caagctagcg 
tggccgccag 
ctagccattt 
aagatcccaa 
aggaagggat 
tttcccggca 
ttgtaccgga 
aggatgaaga 
ggtcttgtca 
atatgaggac 
gagtggtggg 
attttgtatt 
agaaccggag 
cccgacatca 
ttctaacaca 
cgtgagagtt 
tgggcaacct 



gaactcccga 
ggacgacccg 
ctgcaaactt 
tcgcagggct 
caattcggag 
gccgcatcga 
taccaataag 
atcttttagt 
gagacgctca 
aaaaaaggga 
ttatagggga 
tgacttaaat 
cccttttcgt 
aaaagacgac 
ttggggaaag 
tgcaaaaact 
ccagggctta 
ggtttcctga 
ctatgttctt 
aagagtgctg 
gtgtctgttc 
ccgcagaccc 
cctcggataa 
tttcttgtct 
actgaaaatg 
tcttttggac 
tgaaccacct 
cgaggaggcg 
tgacttactc 
gcccgagcag 
ggtgatcgat 
gggtgaggag 
ttatcaccgg 
ctgtggcatg 

tttggtgtgg 

gtgatttttt 

cctgcaagac 
cctgtgtcta 
cctcctgaga 
ggtgggcgtc 
ttggacttga 



gagtgtccta 
tctgcgcaca 
ggcatagctc 
ctcacccttg 
aactcgacct 
ttttgtcctt 
accaatccaa 
actattttta 
acctcaattg 
aaaaagagtg 
ccttacatct 
tgggataggt 
gaaagactcg 
atgaaacaac 
attttccata 
catggcatga 
agtaagtttt 

cttggtttgg 
ttggaattta 
attttttgag 
gccatcccgt 
cggcgaccct 
gtgacccttg 
ggctatcatc 
agacatatta 
cagctgatcg 
acccttcacg 
gtttcgcaga 
acttttccgc 
ccggagcaga 
cttacctgcc 
tttgtgttag 
aggaatacgg 
tttgtctaca 
taattttttt 
taaaaggtcc 
ctacccgccg 
gagaatgcaa 
tacacccggt 
gccaggctgt 
gctgtaaacg 



cacctagggg 
aacggatgag 
tgctttgcct 
actcttttaa 
tcctcctgag 
cagaaataga 
taggtagatt 
ctcaaattca 
aagaacaggt 
tttttgtcaa 
acagaccaac 
gggttacagt 
ccagagctag 
aggtacatga 

ccaaggaggg 
gttattatga 
tggttacaaa 
tatcaaaggt 
tccaaatctt 
taaacttgca 
ctccgctcgt 
caggtcggcc 
tctctatttc 
acaagagcgg 
tctgccacgg 
aagaggtact 
aactgtatga 
tttttcccga 
cggcgcccgg 
gagccttggg 
acgaggctgg 
attatgtgga 
gggacccaga 
gtaagtgaaa 
tttaattttt 
tgtgtctgaa 
tcctaaaatg 
tagtagtacg 
ggtcccgctg 
ggaatgtatc 
ccccaggcca 



840 

900 

960 



agaagcagcc 60 
cccatcagac 120 
ggggctattg 180 
tagctcttct 240 
gcaaggacca 300 
aataagaatg 360 
attagttact 420 
gaagttagaa 480 
gcaaggacta 540 
aataggagac 600 
agatgccccc 660 
caatggctat 720 
acctccttgg 780 
ttatatttat 
gacagtggct 
atagccttta 
ctgttcttaa 1020 
tctgatctga 1080 
atgtaaatgc 1140 
acagtcctaa 1200 
cacttatcct 1260 
gactgcggca 1320 
tactatttgg 1380 
aacggactca 1440 
aggtgttatt 1500 
ggctgataat 1560 
tttagacgtg 1620 
ctctgtaatg 1680 
ttctccggag 1740 
tccggtttct 1800 
ctttccaccc 1860 
gcaccccggg 1920 
tattatgtgt 1980 
attatgggca 2040 
acagttttgt 2100 
cctgagcctg 2160 
gcgcctgcta 2220 
gatagctgtg 2280 
tgccccatta 2340 
gaggacttgc 2400 
taaggtgtaa 2460 
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acctgtgatt 
taaagggtga 
atataatgcg 
tggaagattt 
tttggaggtt 
acaagtggga 
tgggtcacca 
ggcgcgctgc 
cccatctgag 
tgagacacaa 
aggagcagca 
acccgagagc 
agaactgaga 

ggagcggggg 

cagacaccgt 
tgatctgctg 

ggatgatttt 
gtacaagatc 
cgaggtggag 
gccgggggtg 
ttttagcggt 
tgggtttaac 
ttactgctgc 
cctctttgaa 
tgtggcctcc 
catggtatgt 
ctgtcacctg 
tgagcataac 
accttaccaa 
ggtgaacctg 
tgagacccgc 
tgtgatgctg 
cgctgagttt 
cttaagggtg 
agcagccgcc 
gacaacgcgc 
tggtcgcccc 
gccgttggag 
tgtgactgac 
ccgcgatgac 
tgtcgtttct 
ccctcccaat 
gcaagtgtct 
cataattgga 
gtgtataatg 
gaactgatga 
aagaaatgcc 
aaaagaagag 
gtcatgctgt 
aagctgcact 
ataacagtta 
ctattaataa 
ataaggaata 
gtagaggttt 
atgaatgcaa 
aatagcatca 
tccaaactca 
gggtgtggaa 
tagtcagcaa 
atgcatctca 
actccgccca 
qaggccgagg 



gcgtgtgtgg 
gataatgttt 
ccgtgggcta 
ttctgctgtg 
tctgtggggc 
atttgaagag 
ggcgcttttc 
ggctgctgtt 
cggggggtac 
gaatcgcctg 
gcagcagcag 
cggcctggac 
cgcattttga 
gcttgtgagg 
cctgagtgta 
gcgcagaagt 
gaggaggcta 
agcaaacttg 
atagatacgg 
cttggcatgg 
acggttttcc 
aatacctgtg 
tggaaggggg 
aggtgtacct 
gactgtggtt 
ggcaactgcg 
ctgaagacca 
atactgaccc 
tgcaatttga 
aacggggtgt 
accaggtgca 
gatgtgaccg 
ggctctagcg 
ggaaagaata 
gccgccatga 
atgcccccat 
gtcctgcccg 
actgcagcct 
tttgctttcc 
aagttgacgg 
cagcagctgt 
gcggtttaaa 
tgctgtctct 
caaactacct 
tgttaaacta 
atgggagcag 
atctagtgat 
aaaggtagaa 
gtttagtaat 
gctatacaag 
taatcataac 
ctatgctcaa 
tttgatgtat 
tacttgcttt 
ttgttgttgt 
caaatttcac 
tcaatgtatc 
agtccccagg 
ccaggtgtgg 
attagtcagc 
gttccgccca 
ccgcctcggc 



ttaacgcctt 
aacttgcatg 
atcttggtta 
cgtaacttgc 
tcatcccagg 
cttttgaaat 
caagagaagg 
gcttttttga 
ctgctggatt 
ctactgttgt 
gaggaagcca 
cctcgggaat 
caattacaga 
ctacagagga 
ttacttttca 
attccataga 
ttagggtata 
taaatatcag 
aggatagggt 
acggggtggt 
tggccaatac 
tggaagcctg 
tggtgtgtcg 
tgggtatcct 
gcttcatgct 
aggacagggc 
ttcacgtagc 
gctgttcctt 
gtcacactaa 
ttgacatgac 
gaccctgcga 
aggagctgag 
atgaagatac 
tataaggtgg 
gcaccaactc 
gggccggggt 
caaactctac 
ccgccgccgc 
tgagcccgct 
ctcttttggc 
tggatctgcg 
acataaataa 
cgagggatct 
acagagattt 
ctgattctaa 
tggtggaatg 
gatgaggcta 
gaccccaagg 
agaactcttg 
aaaattatgg 
atactgtttt 
aaattgtgta 
agtgccttga 
aaaaaacctc 
taacttgttt 
aaataaagca 
ttatcatgtc 
ctccccagca 
aaagtcccca 
aaccatagtc 
ttctccgccc 
ctctgagcta 



tgtttgctga 
gcgtgttaaa 
catctgacct 
tggaacagag 
caaagttagt 
cctgtggtga 
tcatcaagac 
gttttataaa 
ttctggccat 
cttccgtccg 
ggcggcggcg 
gaatgttgta 
ggatgggcag 
ggctaggaat 
acagatcaag 
gcagctgacc 
tgcaaaggtg 
gaattgttgc 
ggcctttaga 
tattatgaat 
caaccttatc 
gaccgatgta 
ccccaaaagc 
gtctgagggt 
agtgaaaagc 
ctctcagatg 
cagccactct 
gcatttgggt 
gatattgctt 
catgaagatc 
gtgtggcggt 
gcccgatcac 
agattgaggt 
gggtCttatg 

gtttgatgga 
gcgtcagaat 
taccttgacc 
ttcagccgct 
tgcaagcagt 
acaattggat 
ccagcaggtt 
aaaaccagac 
ttgtgaagga 
aaagctctaa 

ttgtttgtgt 
cctttaatga 
ctgctgactc 
actttccttc 
cttgctttgc 
aaaaatattc 

ttcttactcc 
cctttagctt 
ctagagatca 
ccacacctcc 
attgcagctt 
tttttttcac 
tggatccggc 
ggcagaagta 
ggctccccag 
ccgcccctaa 
catggctgac 
ttccagaagt 



atgagttgat 

tgggg c 9999 

catggaggct 
ctctaacagt 
ctgcagaatt 
gctgtttgat 
tttggatttt 
ggataaatgg 
gcatctgtgg 
cccggcgata 
gcaggagcag 
caggtggctg 
gggctaaagg 
ctagctttta 
gataattgcg 
acttactggc 
gcacttaggc 
tacatttctg 
tgtagcatga 
gtaaggttta 
ctacacggtg 
agggttcggg 
agggcttcaa 
aactccaggg 
gtggctgtga 
ctgacctgct 
cgcaaggcct 
aacaggaggg 
gagcccgaga 
tggaaggtgc 
aaacatatta 
ttggtgctgg 
actgaaatgt 
tagttttgta 
agcattgtga 
gtgatgggct 
tacgagaccg 
gcagccaccg 
gcagcttccc 
tctttgaccc 
tctgccctga 
tctgtttgga 
accttacttc 
ggtaaatata 
attttagatt 
ggaaaacctg 
tcaacattct 
agaattgcta 

tatttacacc 
tgtaaccttt 
acacaggcat 
tttaatttgt 
taatcagcca 
ccctgaacct 
ataatggtta 
tgcattctag 
tgtggaatgt 
tgcaaagcat 
caggcagaag 
ctccgcccat 

taattttttt 

agtgaggagg 



gtaagtttaa 2520 
cttaaagggt 2580 
tgggagtgtt 2640 
acctcttggt 2700 
aaggaggatt 2760 
tctttgaatc 2820 
tccacaccgg 2880 
agcgaagaaa 2940 
agagcggttg 3000 
ataccgacgg 3060 
agcccatgga 3120 
aactgtatcc 3180 
gggtaaagag 3240 
gcttaatgac 3300 
ctaatgagct 3360 
tgcagccagg 3420 
cagattgcaa 3480 
ggaacggggc 3540 
taaatatgtg 3600 
ctggccccaa 3660 
taagcttcta 3720 
gctgtgcctt 3780 
ttaagaaatg 3840 
tgcgccacaa 3900 
ttaagcataa 3960 
cggacggcaa 4020 
ggccagtgtt 4080 
gggtgttMt 4140 
gcatgtccaa 4200 
tgaggtacga 4260 
ggaaccagcc 4320 
cctgcacccg 4380 
gtgggcgtgg 4440 
tctgttttgc 4500 
gctcatattt 4560 
ccagcattga 4620 
tgtctggaac 4680 
cccgcgggat 4740 
gttcatccgc 4800 
gggaacttaa 4860 
aggcttcctc 4920 
tttggatcaa 4980 
tgtggtgtga 5040 
aaatttttaa 5100 
ccaacctatg 5160 
ttttgctcag 5220 
actcctccaa 5280 
agttttttga 5340 
acaaaggaaa 5400 
ataagtaggc 5460 
agagtgtctg 5520 
aaaggggtta 5580 
taccacattt 5640 
gaaacataaa 5700 
caaataaagc 5760 
ttgtggtttg 5820 
gtgtcagtta 5880 
gcatctcaat 5940 
tatgcaaagc 6000 
cccgccccta 6060 
tatttatgca 6120 
cttttttgga 6180 



# 
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ggcctaggct 
accactttat 
actggttttt 

tttttaagcc 
ctgggacatg 
atggaaaggc 
tgaactgggt 
gcgcgagctt 
tgacctggtg 
ctttgtcacc 
tatcccgcaa 
aatctccggt 
caggcgggtt 
acctgagcga 
gccgctttaa 
ctcactggta 
aaatcctcga 
tatacgatac 
tttgtgaagg 
taaagctcta 
attgtttgtg 
gcctttaatg 
actgctgact 
gactttcctt 
gcttgctttg 
gaaaaatatt 
tttcttactc 
acctttagct 
actagagatc 
cccacacctc 
tattgcagct 
atttttttca 
ctggatcccc 
acattccaat 
acaaaaagga 
atctgggaag 
caacagcaga 
gaagcactgt 
tgtaggttcc 
ctggataagc 
caactgtagc 
acacaccctg 
gaatgggttt 
tagcagttac 
cacaggttaa 
gatacgccta 
cacttttcgg 
tatgtatccg 
gagtatgagt 
tcctgttttt 
tgcacgagtg 
ccccgaagaa 
atcccgtgtt 
cttggttgag 
attatgcagt 
gatcggagga 
ccttgatcgt 
gatgcctgca 
agcttcccgg 
gcgctcggcc 
gtctcgcggt 
ctacacgacg 



tttgcaaaaa 
cccgcgtcag 
agtgcgccag 
gtagataaac 
ttgcagatcc 
attattgccg 
attcgtcatg 
aaagtgctga 
gataccggtg 
atcttcgcaa 
gatacctgga 
cgctaatctt 
acaatagttt 
aaccctgttc 
tcacggcgca 
tcgcatgatt 
cgtccaggca 
ggtgattggc 
aaccttactt 
aggtaaatat 
tattttagat 
aggaaaacct 
ctcaacattc 
cagaattgct 
ctatttacac 
ctgtaacctt 
cacacaggca 
ttttaatttg 
ataatcagcc 
cccctgaacc 
tataatggtt 
ctgcattcta 
aggaagctcc 
cataggctgc 
aattgggtag 
tcccttccac 
aacatacaag 
ggttgctgtg 
aaaatatcta 
attatcctta 
attttttggg 
cagctccaaa 
tccagcacca 
cccaataacc 
gtcctcattt 

tttttatagg 

ggaaatgtgc 
ctcatgagac 
attcaacatt 
gctcacccag 
ggttacatcg 
cgttttccaa 
gacgccgggc 
tactcaccag 
gctgccataa 
ccgaaggagc 
tgggaaccgg 
gcaatggcaa 
caacaattaa 
cttccggctg 
atcattgcag 
gggagtcagg 



gcttggacac 
ggagaggcag 
atctctataa 
aggctgggac 
atgcacgtaa 
taagccgtgg 
tcgataccgt 
aacgcgcaga 
gtactgcggt 
aaccggctgg 
ttgaacagcc 
ttcaacgcct 
ccagtaagta 
aaaccccgct 
caaccgcctg 
aaccgtctga 
cgtattgtga 
taccgtggcg 
ctgtggtgtg 
aaaat 1 1 1 1 a 
tccaacctat 
gttttgctca 
tactcctcca 
aagttttttg 
cacaaaggaa 
tataagtagg 
tagagtgtct 
taaaggggtt 
ataccacatt 
tgaaacataa 
acaaataaag 
gttgtggttt 
tctgtgtcct 
ccatccaccc 
gggtttttca 
tgctgtgttc 
ctgtcagctt 
ttagtaatgt 
gtgttttcat 
tccaaaacag 
gttacagttt 
ggttccccac 
ttttcatgag 
tcagttttaa 
aaattaggca 
ttaatgtcat 
gcggaacccc 
aataaccctg 
tccgtgtcgc 
aaacgctggt 
aactggatct 
tgatgagcac 
aagagcaact 
tcacagaaaa 
ccatgagtga 
taaccgcttt 
agctgaatga 
caacgttgcg 
tagactggat 
gctggtttat 
cactggggcc 
caactatgga 



aagacaggct 
tgcgtaaaaa 
tctcgcgcaa 
acttcacatg 
actcgcaagc 
cggtctggta 
ttgtatttcc 
aggcgatggc 
tgcgattcgt 
tcgtccgctg 
gtgggatatg 
ggcactgccg 
ttctggaggc 
ttaaacatcc 
tgcagtcggc 
tgtggatctg 
tgagcgatgc 
gcaactggat 
acataattgg 
agtgtataat 
ggaactgatg 
gaagaaatgc 
aaaaagaaga 
agtcatgctg 
aaagctgcac 
cataacagtt 
gctattaata 
aataaggaat 
tgtagaggtt 
aatgaatgca 
caatagcatc 
gtccaaactc 
cataaaccct 
tctgtgtcct 
cagaccgctt 
cagaagtgtt 

tgcacaaggg 
gcaaaacagg 
ttttacttgg 
ccttgtggtc 
gagcaggata 
caacagcaaa 
ttttttgtgt 
cagtaacagc 
aaggaattct 
gataataatg 
tatttgttta 
ataaatgctt 
ccttattccc 
gaaagtaaaa 
caacagcggt 
ttttaaagtt 
cggtcgccgc 
gcatcttacg 
taacactgcg 
tttgcacaac 
agccatacca 
caaactatta 
ggaggcggat 
tgctgataaa 
agatggtaag 
tgaacgaaat 



tgcgagatat 
gacgcggact 
cctattttcc 
agcgaaaaat 
cgactgatgc 
ccgggtgcgt 
agctacgatc 
gaaggcttca 
gaaatgtatc 
gttgatgact 

99 c 9 tc 9 tat 
ggcgttgttc 
tgcatccatg 
tgaaacctcg 
ccttgatggt 
gcgcggcatt 
cgaacgtacc 
ttatgagtgg 
acaaactacc 
gtgttaaact 
aatgggagca 
catctagtga 
gaaaggtaga 
tgtttagtaa 
tgctatacaa 
ataatcataa 
actatgctca 
atttgatgta 
ttacttgctt 
attgttgttg 
acaaatttca 
atcaatgtat 
aacctcctct 
cctgttaatt 
tctaagggta 
ggtaaacagc 
cccaacaccc 
aggcacattt 
atcaggaacc 
agtgttcatc 

tttggtcctg 
aaaatgaaaa 
ccctgaatgc 
ttcccacatc 
tgaagacgaa 
gtttcttaga 
tttttctaaa 

caataatatt 

ttttttgcgg 
gatgctgaag 
aagatccttg 
ctgctatgtg 
atacactatt 
gatggcatga 
gccaacttac 
atgggggatc 
aacgacgagc 
actggcgaac 
aaagttgcag 
tctggagccg 
ccctcccgta 
agacagatcg 



gtttgagaat 6240 
catgtgaaat 6300 
cctcgaacac 6360 
acatcgtcac 6420 
cttctgaaca 6480 
tactggcgcg 6540 
acgacaacca 6600 
tcgttattga 6660 
caaaagcgca 6720 
atgttgttga 6780 
tcgtcccgcc 6840 
tttttaactt 6900 
acacaggcaa 6960 
acgctagtcc 7020 
aaaaccatcc 7080 
gacccacgcg 7140 
gacgatgatt 7200 
gccccggatc 7260 
tacagagatt 7320 
act gat t eta 7380 
gtggtggaat 7440 
tgatgagget 7500 
agaccccaag 7560 
tagaactctt 7620 
gaaaattatg 7680 
catactgttt 7740 
aaaat tgtgt 7800 
tagtgccttg 7860 
taaaaaacct 7920 
ttaacttgtt 7980 
caaataaagc 8040 
ettatcatgt 8100 
aettgagagg 8160 
aggteaetta 8220 
attttaaaat 8280 
ccacaaatgt 8340 
tgctcatcaa 8400 
tccccacctg 8460 
cagcactcca 8520 
tgctgactgt 8580 
tagtttgcta 8640 
tttgaccctt 8700 
aagtttaaca 8760 
aaaatatttc 8820 
agggcctcgt 8880 
cgtcaggtgg 8940 
tacattcaaa 9000 
gaaaaaggaa 9060 
cattttgcct 9120 
atcagttggg 9180 
agagttttcg 9240 
gegeggtatt 9300 
ctcagaatga 9360 
cagtaagaga 9420 
ttctgacaac 9480 
atgtaactcg 9540 
gtgacaccac 9600 
tacttactct 9660 
gaccacttct 9720 
gtgagcgtgg 9780 
tegtagttat 9840 
ctgagatagg 9900 
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tgcctcactg 
tgatttaaaa 
catgaccaaa 
gatcaaagga 
aaaaccaccg 
gaaggtaact 
gttaggccac 
gttaccagtg 
atagttaccg 
cttggagcga 
cacgcttccc 
agagcgcacg 
tcgccacctc 
gaaaaacgcc 
catgttcttt 
agctgatacc 
ggaagagcgc 
accgcctcag 
tcctccccct 
caatgcgatg 
gtcaaggaag 
gaggctgatc 
atgctcgagc 
gcaaatattt 
gcacttggag 
ggtttagttt 
ccgtgagatt 
gtgccttcag 
tttagtagca 
gatattggag 
ccacattttg 
ggagatggag 
atggcacctg 
gaatcaaatt 
ggtgaaccaa 
aaccccttgg 
aagtaaaggc 
ccctgtccta 
ataaggcgtc 
ccttgtgaat 
gtcacacctg 
gccccgtact 
cccaagctac 
tcggtggtgg 
ggtccttggg 
atggtgagtg 
gaggtaactg 
ggtgggctca 
ccgttgccca 
aaagagagta 
agaaaaggca 
tcttcagacg 
catccgccgt 
caaccccgac 
ctggttagac 
cctcggtggc 
accgcgaaga 
tcggtaccaa 
agtgggttct 
accgcccatt 
ttggtgccaa 
gtcaaaccgc 



attaagcatt 
cttcattttt 
atcccttaac 
tcttcttgag 
ctaccagcgg 
ggcttcagca 
cacttcaaga 
gctgctgcca 
gataaggcgc 
acgacctaca 
gaagggagaa 
agggagcttc 
tgacttgagc 
agcaacgcgg 
cctgcgttat 
gctcgccgca 
ctgatgcggt 
aagccataga 
tgctgtcctg 
caatttcctc 
gcacggggga 
agcgagctct 
ggccgcttct 
cattaatgta 
ttgtgtctcc 
tgtctccgtt 
ttggataagc 
taagatctcc 
ctccattttc 
ccaaactgcc 
ttaagaccaa 
ctggtgtggt 
tgctgtcaaa 
ctaggccatg 
attcaagccc 
cagtgcttag 
ctttgttgta 
gtcttagatt 
tgttttgaga 
caaaacccaa 
gaccagttgc 
ttagtcctag 
cagtggcagt 
tgagggggcc 
tggcaatgct 
tgttgcccgc 
tgaggggtgc 
cagtggttac 
ttttgagcgc 
ccccaggggg 
cagttggagg 
gtcttgcgcg 
ctcaagaccg 
cgccacccgc 
gcctttctcg 
ggagtaccgt 
gtttgtcctc 
gcttgggtct 
ctagttagcc 
tgcgtcaatg 
aacaaactcc 
tatccacgcc 



ggtaactgtc 
aatttaaaag 
gtgagttttc 
atcctttttt 
tggtttgttt 
gagcgcagat 
actctgtagc 
gtggcgataa 
agcggtcggg 
ccgaactgag 
aggcggacag 
cagggggaaa 
gtcgattttt 
cctttttacg 
cccctgattc 
gccgaacgac 
attttctcct 
gcccaccgca 
ccccacccca 
attttattag 
ggggeaaaca 
agcatttagg 
ttattcttgg 
gttgtggcca 
tgtttcctgt 
taagtaaact 
tgataggtta 
atttctaaag 
gtcaaatctt 
tttaacagcc 
agtgagttta 
ccacaaagtt 
actaaggcca 
gccaattttt 
atctcctgca 
gttaacctca 
gttaatatcc 
tagttggttt 
atcaatcctt 
ggctccagta 
tacggtcaaa 
ttttccattt 
agttagaggg 
tgatgtttgc 
aagtttggag 
gaccattaga 
agatatttcc 
attttgggag 
aagcatgcca 
actctcttga 
accggtttcc 
cttcatcttg 
cctactttaa 
tgccgcccgc 
agaggttttc 
tcggaggccg 
aaccgcgagc 
ccctatagtg 
agagagctct 
gggcggagtt 
cattgacgtc 
cattgatgta 



agaccaagtt 
gatctaggtg 
gttccactga 
tctgcgcgta 
gccggatcaa 
accaaatact 
accgcctaca 
gtcgtgtctt 
ctgaacgggg 
atacctacag 
gtatccggta 
cgcctggtat 
gtgatgctcg 
gttcctggcc 
tgtggataac 
cgagcgcagc 
tacgcatctg 
tccccagcat 
ccccccagaa 
gaaaggacag 
acagatggct 
tgacactata 
gcaatgtatg 
gaccagtccc 
gtaccgttta 
tgactgacaa 
ggcataaatc 
ttccaatatt 
ataataagat 
aaaactgaaa 
gcatctttct 
agcttatcat 
gttcctagtt 
gttttgaggg 
ttaatggcta 
agctttttgg 
aagttgtggg 
tgagcatcaa 
agtcctcctg 

actttagttt 
gtgtttaggt 
tgtgtataaa 
ggtgaggcag 
agggctagct 
tcgtgcacgg 
ggtgcggcgg 
aggtttatgt 
gtaaggttgc 
ttggaggtaa 
aacccattgg 
gtgtcatatg 
gatctcaagc 
ttacatcatc 
cacggtgctc 
cgatccggtc 
acgggtttcc 
ccaacagcga 
agtcgtatta 
gcttatatag 
gttacgacat 
aatggggtgg 
ctgccaaaac 



tactcatata 
aagatccttt 
gcgtcagacc 
atctgctgct 
gagctaccaa 
gtccttctag 
tacctcgctc 
accgggttgg 
ggttcgtgca 
cgtgagctat 
agcggcaggg 
ctttatagtc 
tcaggggggc 
ttttgctggc 
cgtattaccg 
gagtcagtga 
tgcggtattt 
gcctgctatt 
tagaatgaca 
tgggagtggc 
ggcaactaga 
gaatagggcc 
aaaaagtgta 
atgaaaatga 
gtgtaatggt 
tgttactttt 
caacagcgtt 
ctgggtccag 
gagcactttg 
ctgtagcaag 
ctgcatttag 
tatttttgtt 
taggaaccat 
gatttgtgtt 
tggctgtagc 
aattgtttga 
ctgagtttat 
acggataac:t 
ctacattaag 
gcaaggaagt 
cgtctgttac 
tgggctcttt 
tgatagtaag 
ttccttctga 
ttagcggggc 
cagccacagt 
ttgacttggt 
cggcctcgtc 
ctagaggttc 
gggatacaaa 
gatacacggg 
ctgccacacc 
agcagcacct 
agcctacctt 
gatgcggact 
gatccaagag 
gctcgaattc 
atttcgataa 
acctcccacc 
tttggaaagt 
agacttggaa 
cgcatcacca 



tactttagat 
ttgataatct 
ccgtagaaaa 
tgcaaacaaa 
ctctttttcc 
tgtagccgta 
tgctaatcct 
actcaagacg 
cacagcccag 
gagaaagcgc 
tcggaacagg 
ctgtcgggtt 
ggagcctatg 
cttttgctca 
cctttgagtg 
gcgaggaagc 
cacaccgcat 
gtcttcccaa 
cctactcaga 
accttccagg 
aggcacagtc 
ctctagatgc 
agaggatgtg 
catagagtat 
tagtgttaca 
ggcagtttta 
tgtataggct 
gaaggaattg 
aactgttcca 
tatttgactg 
tctacagtta 
tcctactgta 
agccttgttt 
tggtgcatta 
gtcaaacatc 
agctgtaaac 
aaaaagaggg 
aacatcaagt 
ttgcatattg 
attattaata 
atgcaaagga 
caagtcaatg 
ggtactgcta 
cactgtgagg 
ctgtgattgc 
tagggcttct 
ttttttgaga 
cagagagagg 
ggataggcgc 
gggaggagta 
gttgaaggta 
tcacctcgac 
ccgccagaaa 
gcgactgtga 
cgctcaggtc 
tactggaaag 
agatccgagc 
gccagtaagc 
gtacacgcct 
cccgttgatt 
atccccgtga 
tggtaatagc 



9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
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gatgactaat 

ataatgccag 

atacacttga 

atggaaagtc 

ggggtcgttg 

catatatggg 

tcaatgtcaa 

aagcagattc 

cgcgcactac 

ggcatcagag 

tctatggtgc 

ccgctatcgc 

gcgccctgac 

gggagctgca 



acgtagatgt 

gcgggccatt 

tgtactgcca 

cctattggcg 

ggcggtcagc 

ctatgaacta 

cgcgtatatc 

ttcatgcaat 

tcagcgacct 

cagattgtac 

actctcagta 

tacgtgactg 

gggcttgtct 

tgtgtcagag 



actgccaagt 

taccgtcatt 

agtgggcagt 

ttactatggg 

caggcgggcc 

atgaccccgt 

tggcccgtac 

tgtcggtcaa 

ccaacacaca 

tgagagtcga 

caatctgctc 

ggtcatggct 

gctcccggca 

gttttcaccg 



aggaaagtcc 

gacgtcaata 

ttaccgtaaa 

aacatacgtc 

atttaccgta 

aattgattac 

atcgcgaagc 

gccttgcctt 

agcagggagc 

ccatagggga 

tgatgccgca 

gcgccccgac 

tccgcttaca 

tcatcaccga 



cataaggtca 

ggggg c 9 tac 

tagtccaccc 

attattgacg 

agttatgtaa 

tattaataac 

agcgcaaaac 

gttgtagctt 

agatactggc 

tcgggagatc 

tagttaagcc 

acccgccaac 

gacaagctgt 

aacgcgcgag 



tgtactgggc 

ttggcatatg 

attgacgtca 

tcaatgggcg 

cgcggaactc 

tagtcaataa 

gcctaaccct 

aaattttgct 

ttaactatgc 

tcccgatccg 

agtatacact 

acccgctgac 

gaccgtctcc 

gcagc 



13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14455 



<210> 16 
<211> 10610 
<212 > DNA 

<213 > Artificial Sequence 



<223> Description of Artificial Sequence: plasmid 



<400> 16 

gacggatcgg 

tctatgtcat 

tcctcttaca 

cgtgtttatt 

accaccacat 

caacctgcca 

aaaaagcatc 

ctgtcgagcc 

catgtcgctg 

cggcgaagga 

gcggtggtgc 

atacaacatg 

tgtcctccgg 

cagcaccaca 

ggggaecraca 

cctcataaac 

ccggtaccat 

ggccaaaacc 

gagagcccag 

acacaggcac 

atcccaggga 

cacgtaactc 

cagtatggta 

gcgccgagac 

agtcatattt 

ctcgccgctt 

ggcgccccct 

ccaccaccgc 

cgggaggagc 

aaaacctcaa 

tctacagcca 

caaacggccc 

ataaacattc 

atatctctaa 

ccctccacct 

cctgtataag 

gcagggccag 



gagatccgcg 

tttcatggga 

ctttttcata 

tttcaattgc 

agcttataca 

cctccctccc 

atatcatggg 

aaacgctcat 

tccagctgct 

gaagtccacg 

tgcagcagcg 

gcagtggtct 

gcacagcagc 

atattgttca 

gaacccacgt 

acgctggaca 

at aaac ct c t 

tgcccgccgg 

gactcgtaac 

acgtgcatac 

acaacccatt 

acgttgtgca 

gcgcgggttt 

aaccgagatc 

cctgaagcaa 

agatcgctct 

ggcttcgggt 

agaataagcc 

gggaagagct 

aatgaagatc 

aagaacagat 

tcacgtccaa 

cagcaccttc 

gcaaatcccg 

tcagcctcaa 

attcaaaagc 

ctgaacataa 



cggtacacag 

ctggtctggc 

cattgcccaa 

agaaaat 1 1 c 

gatcaccgta 

aacacacaga 

taacagacat 

cagtgatatt 

gagccacagg 

cctacatggg 

cgcgaataaa 

cctcagcgat 

gcaccctgat 

aaatcccaca 

ggccatcata 

taaacattac 

gattaaacat 

ctatacactg 

catggatcat 

acttcctcag 

cctgaatcag 

ttgtcaaagt 

ctgtctcaaa 

gtgttggtcg 

aaccaggtgc 

gtgtagtagt 

tctatgtaaa 

acacccagcc 

ggaagaacca 

tattaagtga 

aatggcattt 

gtggacgtaa 

aaccatgccc 

aatattaagt 

gcagcgaatc 

ggaacattaa 

tcgtgcaggt 



aattcaggag 

cacaactaca 

gaataaagaa 

aagtcatttt 

ccttaatcaa 

gtacacagtc 

attcttaggt 

aataaactcc 

ctgctgtcca 

ggtagagtca 

ctgctgccgc 

gattcgcacc 

ctcacttaaa 

gtgcaaggcg 

ccacaagcgc 

ctcttttggc 

ggcgccatcc 

cagggaaccg 

catgctcgtc 

gattacaagc 

cgtaaatccc 

gttacattcg 

aggaggtaga 

tagtgtcatg 

gggcgtgaca 

tgtagtatat 

ctccttcatg 

aacctacaca 

tgtttttttt 

acgcgctccc 

gtaagatgtt 

aggctaaacc 

aaataattct 

ccggccattg 

atgattgcaa 

caaaaatacc 

ctgcacggac 



acacaactcc 

ttaatgaaat 

tcgtttgtgt 

tcattcagta 

actcacagaa 

ctttctcccc 

gttatattcc 

ccgggcagct 

acttgcggtt 

taatcgtgca 

cgccgctccg 

gcccgcagca 

tcagcacagt 

ctgtatccaa 

aggtagatta 

atgttgtaat 

accaccatcc 

ggactggaac 

atgatatcaa 

tcctcccgcg 

acactgcagg 

ggcagcagcg 

cgatccctac 

ccaaatggaa 

aacagatctg 

ccactctctc 

cgccgctgcc 

ttcgttctgc 

tttattccaa 

ctccggtggc 

gcacaatggc 

cttcagggtg 

catctcgcca 

taaaaatctg 

aaattcaggt 

gcgatcccgt 

cagcgcggcc 



aagtgcatac 60 
atttgccaca 120 
tatgtttcaa 180 
gtatagcccc 240 
ccctagtatt 300 
ggctggcctt 360 
acacggtttc 420 
cacttaagtt 480 
gcttaacggg 540 
tcaggatagg 600 
tcctgcagga 660 
taaggcgcct 720 
aactgcagca 780 
agctcatggc 840 
agtggcgacc 900 
tcaccacctc 960 
taaaccagct 1020 
aatgacagtg 1080 
tgttggcaca 1140 
ttagaaccat 1200 
gaagacctcg 1260 
gatgatcctc 1320 
tgtacggagt 1380 
cgccggacgt 1440 
cgtctccggt 1500 
aaagcatcca 1560 
ctgataacat 1620 
gagtcacaca 1680 
aagattatcc 1740 
gtggtcaaac 1800 
ttccaaaagg 1860 
aatctcctct 1920 
ccttctcaat 1980 
ctccagagcg 2040 
tcctcacaga 2100 
aggtcccttc 2160 
acttccccgc 2220 
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caggaacctt 
ccagcgtagc 
ctcaaaaaat 
agataaaggc 

atgtctgcgg 

aagcctgtct 
cggcgtgacc 
tcatgtccgg 
gctaaaaagc 
acagccccca 
aaaccctcct 
cagcggcagc 
ctcgacacgg 
atatatagga 
cacgcgaacc 
ttccgttttc 
tcagtacaat 
tggaggtcgc 
acaattgcat 
ccagatatac 
cattagttca 
ctggctgacc 
taacgccaat 
acttggcagt 
gtaaatggcc 
agtacatcta 
atgggcgtgg 
atgggagttt 
ccccattgac 
tctggctaac 
gggagac cc a 
gttgaggaca 
cgaacggtac 
tctcgagaaa 
gcagcgggtg 
aggcggtctt 
aagcgcgcaa 
ggtcctccaa 
agtccccctg 
cttgcgctca 
aatgtaacca 
tctgcacccc 
gtcgcgggca 
cttagcattg 
tcaggccccc 
actactgcca 
aaactaggac 
gtagcaactg 
gccttgggtt 
gattctcaaa 
ctaaatctaa 
aactacaaca 
aacctaagca 
gatgggcttg 
ggccatggcc 
agttttgaca 
tggaccacac 
actttggtct 
aaaggcagtt 
gacgaaaatg 
aatggagatc 
tcagcttatc 



gacaaaagaa 
cccgatgtaa 
caggcaaagc 
aggtaagctc 
gtttctgcat 
tacaacagga 
gtaaaaaaac 
agtcataatg 
gaccgaaata 
taggaggtat 
gcctaggcaa 
ctaacagtca 
caccagctca 
ctaaaaaatg 
tacgcccaga 
ccacgttacg 
ctgctctgat 
tgagtagtgc 
gaagaatctg 
gcgttgacat 
t age c cat at 
gcccaacgac 
agggactttc 
acatcaagtg 
cgcctggcat 
cgtattagtc 
atagcggttt 
gttttggcac 
gcaaatgggc 
tagagaaccc 
agcttggtac 
aactcttcgc 
tccgccaccg 
ggcgtctaac 
gcggtcgggg 
gagaeggegg 
gaccgtctga 
ctgtgccttt 
gggtactctc 
aaatgggcaa 
ctgtgagccc 
tcacagttac 
acacactcac 
ccacccaagg 
tcaccaccac 
ctggtagctt 
taaagtacgg 
gtccaggtgt 
ttgattcaca 
acagacgcct 
gactaggaca 
aaggccttta 

ctgccaaggg 
aatttggttc 
tagaatttga 
gcacaggtgc 
cagctccatc 
taacaaaatg 
tggctccaat 
gagtgctact 
ttactgaagg 
caaaatctca 



cccacactga 
gctttgttgc 
ctcgcgcaaa 
cggaaccacc 
aaacacaaaa 
aaaacaaccc 
tggtcaccgt 
taagactcgg 
gcccggggga 
aacaaaatta 
aatagcaccc 
gccttaccag 
atcagtcaca 
aegtaaeggt 
aacgaaagcc 
taacttcccg 
gccgcatagt 
gcgagcaaaa 
cttagggtta 
tgattattga 
atggagttcc 
ccccgcccat 
cattgaegte 
tatcatatgc 
tatgcccagt 
ategetatta 
gactcacggg 

caaaatcaac 
ggtaggcgtg 
actgcttact 
egagetegga 
ggtctttcca 
agggacctga 
cagtcacagt 
ttgtttctgg 
atggtcgagg 
agat ace 1 1 c 
tcttactcct 
tttgcgccta 
cggcctctct 
acctctcaaa 
ctcagaagcc 
catgcaatca 
acccctcaca 
egatageagt 
gggeattgae 
ggctcctttg 
gactattaat 
aggcaatatg 
tataettgat 
gggccctctt 

cttgtttaca 
gttgatgttt 
acctaatgca 
ttcaaacaag 
cattacagta 
tcctaactgt 
tggcagtcaa 
atctggaaca 
aaacaattcc 
cacagcctat 
cggtaaaact 



ttatgacacg 
atgggcggcg 
aaagaaagca 
acagaaaaag 
taaaataaca 
ttataageat 
gattaaaaag 
taaacacatc 
atacataccc 
ataggagaga 
tcccgctcca 
taaaaaagaa 
gtgtaaaaaa 
taaagtccac 
aaaaaaccca 
gatcctctcc 
taagccagta 
tttaagctac 
ggcgttttgc 
ctagttatta 
gegttacata 
tgacgtcaat 
aatgggtgga 
caagtacgcc 
acatgacctt 
ccatggtgat 
gatttccaag 
gggactttcc 
tacggtggga 
ggettatega 
tetgaatteg 
gtactcttgg 
gcgagtccgc 
cgcaaggtag 
cggaggtgct 
tgaggtgtgg 
aaccccgtgt 
ccctttgtat 
tccgaacctc 
ctggacgagg 
aaaaccaagt 
ctaactgtgg 
caggccccgc 
gtgtcagaag 
acccttacta 
ttgaaagagc 
catgtaacag 
aatacttcct 
caacttaatg 
gttagttatc 
tttataaact 
gcttcaaaca 
gacgctacag 
ccaaacacaa 
gctatggttc 
ggaaacaaaa 
agactaaatg 
ataettgeta 
gttcaaagtg 
ttcctggacc 
acaaacgctg 
gccaaaagta 



catactcgga getatgetaa 2280 
atataaaatg caaggtgctg 2340 . 
catcgtagtc atgctcatgc 2400 
acaccatttt tctctcaaac 2460 
aaaaaacatt taaacattag 2520 
aagacggact acggccatgc 2580 
caccaccgac agctcctcgg 2640 
aggttgattc atcggtcagt 2700 
gcaggcgtag agacaacatt 2760 
aaaacacata aacacctgaa 2820 
gaacaacata cagcgcttca 2880 
aacctattaa aaaaacacca 2940 
gggccaagtg cagagcgagt 3000 
aaaaaacacc cagaaaaccg 3060 
caacttcctc aaategteae 3120 
cgatccccta tggtcgactc 3180 
tctgctccct gcttgtgtgt 3240 
aacaaggcaa ggcttgaccg 3300 
gctgcttcgc gatgtacggg 3360 
atagtaatca attacggggt 3420 
acttacggta aatggcccgc 3480 
aatgaegtat gttcccatag 3540 
etatttaegg taaactgccc 3600 
ccctattgac gtcaatgacg 3660 
atgggacttt cctacttggc 3720 
gcggttttgg cagtacatca 3780 
tctccacccc 'attgaegtea 3840 
aaaatgtcgt aacaactccg 3900 
ggtetatata agcagagctc 3960 
aattaatacg actcactata 4028 
agctcgctgt tgggctcgcg 4080 
atcggaaacc cgtcggcctc 4140 
atcgaccgga tcggaaaacc 4200 
gctgagcacc gtggcgggcg 4260 
gctgatgatg taattaaagt 4320 
caggettgag atccaagatg 4380 
atccatatga cacggaaacc 4440 
cccccaatgg gtttcaagag 4500 
tagttacctc caatggcatg 4560 
ccggcaacct tacctcccaa 4620 
caaacataaa cctggaaata 4680 
ctgccgccgc acctctaatg 4740 
taaccgtgca cgactccaaa 4800 
gaaagctagc cctgcaaaca 4860 
tcactgcctc accccctcta 4920 
ccatttatac acaaaatgga 4980 
acgacctaaa cactttgacc 5040 
tgcaaactaa agttactgga 5100 
tageaggagg actaaggatt 5160 
ccjtttgatgc tcaaaaccaa 5220 
cagcccacaa ettggatatt 5280 
attccaaaaa gcttgaggtt 5340 
ccatagccat taatgcagga 5400 
atcccctcaa aacaaaaatt 5460 
ctaaactagg aactggcctt 5520 
ataatgataa gctaactttg 5580 
cagagaaaga tgctaaactc 5640 
cagtttcagt tttggctgtt 5700 
ctcatcttat tataagattt 5760 
cagaatattg gaactttaga 582 
ttggatttat gcctaaccta 5880 
acattgtcag tcaagtttac 5940 
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ttaaacggag 
acaggagaca 
aactacatta 
taaagaagcg 
gctagagctc 
ccctcccccg 
aatgaggaaa 
gggcaggaca 
ggctctatgg 
ccctgtagcg 
cttgccagcg 
gccggctttc 
ttacggcacc 
ccctgataga 
ttgttccaaa 
attttgggga 
aattaattct 
ggcagaagta 
ggctccccag 
ccgcccctaa 
catggctgac 
ttccagaagt 
gcttgtatat 
gaacaagatg 
gactgggcac 
gggcgcccgg 
gaggcagcgc 
gttgtcactg 
ctgtcatctc 
ctgcatacgc 
cgagcacgta 
caggggctcg 
gatctcgtcg 
ttttctggat 
ttggctaccc 
ctttacggta 
ttcttctgag 
cacgagattt 
gggacgccgg 
ccaacttgtt 
caaataaagc 
cttatcatgt 
tgtttcctgt 
taaagtgtaa 
cactgcccgc 
gcgcggggag 
tgcgctcggt 
tatccacaga 
ccaggaaccg 
agcatcacaa 
accaggcgtt 
ccggatacct 
gtaggtatct 
ccgttcagcc 
gacacgactt 
taggcggtgc 
tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 



acaaaactaa 
caactccaag 
atgaaatatt 
gccgctcgag 
gctgatcagc 
tgccttcctt 
ttgcatcgca 
gcaaggggga 
cttctgaggc 
gcgcattaag 
ccctagcgcc 
cccgtcaagc 
tcgaccccaa 
cggtttttcg 
ctggaacaac 
tttcggccta 
gtggaatgtg 
tgcaaagcat 
caggcagaag 
ctccgcccat 
taattttttt 
agtgaggagg 
ccattttcgg 
gattgcacgc 
aacagacaat 
ttctttttgt 
ggctatcgtg 
aagcgggaag 
accttgctcc 
ttgatccggc 
ctcggatgga 
cgccagccga 
tgacccatgg 
tcatcgactg 
gtgatattgc 
tcgccgctcc 
cgggactctg 
cgattccacc 
ctggatgatc 
tattgcagct 
atttttttca 
ctgtataccg 
gtgaaattgt 
agcctggggt 
tttccagtcg 
aggcggtttg 
cgttcggctg 
atcaggggat 
taaaaaggcc 
aaatcgacgc 
tccccctgga 
gtccgccttt 
cagttcggtg 
cgaccgctgc 
atcgccactg 
tacagagttc 
ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 



acctgtaaca 
tgcatactct 
tgccacatcc 
catgcatcta 
ctcgactgtg 
gaccctggaa 
ttgtctgagt 
ggattgggaa 
ggaaagaacc 
cgcggcgggt 
cgctcctttc 
tctaaatcgg 

aaaac t t gat 
ccctttgacg 
actcaaccct 
ttggttaaaa 
tgtcagttag 
gcatctcaat 
tatgcaaagc 
cccgccccta 
tatttatgca 
cttttttgga 
atctgatcaa 
aggttctccg 
cggctgctct 
caagaccgac 
gctggccacg 
ggactggctg 
tgccgagaaa 
tacctgccca 
agccggtctt 
actgttcgcc 
cgatgcctgc 
tggccggctg 
tgaagagctt 
cgattcgcag 
gggttcgaaa 
gccgccttct 
ctccagcgcg 
tataatggtt 
ctgcattcta 
tcgacctcta 
tatccgctca 
gcctaatgag 
ggaaacctgt 
cgtattgggc 
cggcgagcgg 
aacgcaggaa 
gcgttgctgg 
tcaagtcaga 
agctccctcg 
ctcccttcgg 
taggtcgttc 
gccttatccg 
gcagcagcca 
ttgaagtggt 
ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 



ctaaccatta 
atgtcatttt 
tcttacactt 

gaggg ccc ta 

ccttctagtt 
ggtgccactc 
aggtgtcatt 
gacaatagca 
agctggggct 
gtggtggtta 
gctttcttCC 
ggcatccctt 
tagggtgatg 
ttggagtcca 
atctcggtct 
aatgagctga 
ggtgtggaaa 
tagtcagcaa 
atgcatctca 
actccgccca 
gaggccgagg 
ggcctaggct 
gagacaggat 

gccgcttggg 
gatgccgccg 
ctgtccggtg 
acgggcgttc 
ctattgggcg 
gtatccatca 
ttcgaccacc 
gtcgatcagg 
aggctcaagg 
ttgccgaata 
ggtgtggcgg 
ggcggcgaat 
cgcatcgcct 
tgaccgacca 
atgaaaggtt 
gggatctcat 
acaaataaag 
gttgtggttt 
gctagagctt 
caattccaca 
tgagctaact 
cgtgccagct 
gctcttccgc 
tatcagctca 
agaacatgtg 
cgtttttcca 
ggtggcgaaa 
tgcgctctcc 
gaagcgtggc 
gctccaagct 
gtaactatcg 
ctggtaacag 
ggcctaacta 
ttaccttcgg 

gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 



cactaaacgg 
catgggactg 
tttcatacat 
ttctatagtg 
gccagccatc 
ccactgtcct 

ctattctggg 
ggcatgctgg 
ctagggggta 
cgcgcagcgt 
cttcctttct 
tagggttccg 
gttcacgtag 
cgttctttaa 
attcttttga 
tttaacaaaa 
gtccccaggc 
ccaggtgtgg 
attagtcagc 
gttccgccca 
ccgcctctgc 
tttgcaaaaa 
gaggatcgtt 
tggagaggct 
tgttccggct 
ccctgaatga 
cttgcgcagc 
aagtgccggg 
tggctgatgc 
aagcgaaaca 
atgatctgga 
cgcgcatgcc 
tcatggtgga 
accgctatca 
gggctgaccg 
tctatcgcct 
agcgacgccc 
gggcttcgga 
gctggagttc 
caatagcatc 
gtccaaactc 
ggcgtaatca 
caacatacga 
cacattaatt 
gcattaatga 
ttcctcgctc 
ctcaaaggcg 
agcaaaaggc 
taggctccgc 
cccgacagga 
tgttccgacc 
gctttctcaa 
gggctgtgtg 
tcttgagtcc 
gattagcaga 
cggctacact 
aaaaagagtt 

tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 



tacacaggaa 6000 
gtctggccac 6060 
tgcccaagaa 6120 
tcacctaaat 6180 
tgttgtttgc 6240 
ttcctaataa 6300 
gggtggggtg 6360 
ggatgcggtg 6420 
tccccacgcg 6480 
gaccgctaca 6540 
cgccacgttc 6600 
atttagtgct 6660 
tgggccatcg 6720 
tagtggactc 6780 
tttataaggg 6840 
atttaacgcg 6900 
tccccaggca 6960 
aaagtcccca 7020 
aaccatagtc 7080 
ttctccgccc 7140 
ctctgagcta 7200 
gctcccggga 7260 
tcgcatgatt 7320 
attcggctat 7380 
gtcagcgcag 7440 
actgcaggac 7500 
tgtgctcgac 7560 
gcaggatctc 7620 
aatgcggcgg 7680 
tcgcatcgag 7740 
cgaagagcat 7800 
cgacggcgag 7860 
aaatggccgc 7920 
ggacatagcg 7980 
cttcctcgtg 8040 
tcttgacgag 8100 
aacctgccat 8160 
atcgttttcc 8220 
ttcgcccacc 8280 
acaaatttca 8340 
atcaatgtat 8400 
tggtcatagc 8460 
gccggaagca 8520 
gcgttgcgct 8580 
atcggccaac 8640 
actgactcgc 8700 
gtaatacggt 8760 
cagcaaaagg 8820 
ccccctgacg 8880 
ctataaagat 8940 
ctgccgctta 9000 
tgctcacgct 9060 
cacgaacccc 9120 
aacccggtaa 9180 
gcgaggtatg. 9240 
agaaggacag 9300 
ggtagctctt 9360 
cagcagatta 9420 
tctgacgctc 9480 
aggatcttca 9540 
tatgagtaaa 9600 



atctgtctat 9660 
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ttcgttcatc 

taccatctgg 

tatcagcaat 

ccgcctccat 

atagtttgcg 

gtatggcttc 

tgtgcaaaaa 

cagtgttatc 

taagatgctt 

ggcgaccgag 

ctttaaaagt 

cgctgttgag 

ttactttcac 

gaataagggc 

gcatttatca 

aacaaatagg 



catagttgcc 

ccccagtgct 

aaaccagcca 

ccagtctatt 

caacgttgtt 

attcagctcc 

agcggttagc 

actcatggtt 

ttctgtgact 

ttgctcttgc 

gctcatcatt 

atccagttcg 

cagcgtttct 

gacacggaaa 

gggttattgt 

ggttccgcgc 



tgactccccg 

gcaatgatac 

gccggaaggg 

aattgttgcc 

gccattgcta 

ggttcccaac 

tccttcggtc 

atggcagcac 

ggtgagtact 

ccggcgtcaa 

ggaaaacgtt 

atgtaaccca 

gggtgagcaa 

tgttgaatac 

ctcatgagcg 

acatttcccc 



tcgtgtagat 

cgcgagaccc 

ccgagcgcag 

gggaagctag 

caggcatcgt 

gatcaaggcg 

ctccgatcgt 

tgcataattc 

caaccaagtc 

tacgggataa 

cttcggggcg 

ctcgtgcacc 

aaacaggaag 

tcatactctt 

gatacatatt 

gaaaagtgcc 



aactacgata 

acgctcaccg 

aagtggtcct 

agtaagtagt 

ggtgtcacgc 

agttacatga 

tgtcagaagt 

tcttactgtc 

attctgagaa 

taccgcgcca 

aaaactctca 

caactgatct 

gcaaaatgcc 

cctttttoaa 

tgaatgtatt 

acctgacgtc 



cgggagggct 

gctccagatt 

gcaactttat 

tcgccagtta 

tcgtcgtttg 

tcccccatgt 

aagttggccg 

atgccatccg 

tagtgtatgc 

catagcagaa 

aggatcttac 

tcagcatctt 

gcaaaaaagg 

tattattgaa 

tagaaaaata 



9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10610 



<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer 

<400> 17 .24 

tgtacaccgg atccggcgca cacc 



<210> 18 
<211> 35 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: Primer 
cacaacgagc tcaattaatt aattgccaca tcctc 

<210> 19 
<211> 4 
<212> PRT 
<213> adenovirus 

<400> 19 
Thr Leu Trp Thr 
1 

< 210 > 20 
< 211 > 12 
<212> PRT 
<213> adenovirus 

Pro°Ser°Ala Ser Ala Ser Ala Ser Ala Pro Gly Ser 
1 5 10 



< 210 > 22 
<211> 327 
<212> DNA 
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<213> adenovirus 



<400> 22 

agatctgaat 

ccagtactct 

tgagcgagtc 

agtcgcaagg 

tggcggaggt 

aggtgaggtg 



tcgagctcgc 

tggatcggaa 

cgcatcgacc 

taggctgagc 

gctgctgatg 

tggcaggctt 



tgttgggctc 

acccgtcggc 

ggatcggaaa 

accgtggcgg 

atgtaattaa 

gagatct 



gcggttgagg 

ctccgaacgg 

acctctcgag 

gcggcagcgg 

agtaggcggt 



acaaactctt 

tactccgcca 

aaaggcgtct 

gtggcggtcg 

cttgagacgg 



cgcggtcttt 60 
ccgagggacc 120 
aaccagtcac 180 
gggttgtttc 240 
cggatggtcg 300 
327 



<210> 23 
<2ll> 32480 
<212> DNA 
<213> adenovirus 



<400> 23 
catcatcaat 
ttgtgacgtg 
gatgttgcaa 
gtgtgcgccg 
taaatttggg 
agtgaaatct 
aagcttgaat 
tatgcttccg 
cagctatgac 
aggaacttgg 
agggtgtgga 
ttagtcagca 
catgcatctc 
aactccgccc 
agaggccgag 
aggcctaggc 
tcgaacactt 
atcgtcacct 
tctgaacaat 
gaccagaaac 
ggcgagatcg 
cttaatcgcc 
accgatcgcc 
ccggcaccag 
gtcgtcgtcc 
acctatccca 
tcgctcacat 
gatggcgtta 
gacagtcgtt 
ctcgcggtga 
cggatgagcg 
gatttccatg 
gttcagatgt 
gaaacgcagg 
ggttatgccg 
gaaatcccga 
gaagcagaag 
ctgaacggca 
ggtcaggtca 
tttaacgccg 
cgctacggcc 
aatcgtctga 
gtgcagcgcg 
cacggcgcta 

gtgcagtatg 

gcgcgcgtgg 



aatatacctt 
gcgcggggcg 
gtgtggcgga 
gtgtacacag 
cgtaaccgag 
gaataatttt 
tcgattaatg 
gctcgtatgt 
catgattacg 
ttaggtacct 
aagtccccag 
accaggtgtg 
aattagtcag 
agt.tccgccc 
gccgcctcgg 
ttttgcaaaa 
tttaagccgt 
gggacatgtt 
ggaaaggcat 
agcacctcga 
atcccgtcgt 
ttgcagcaca 
cttcccaaca 
aagcggtgcc 
cctcaaactg 
ttacggtcaa 
ttaatgttga 
actcggcgtt 
tgccgtctga 
tggtgctgcg 
gcattttccg 
ttgccactcg 
gcggcgagtt 
tcgccagcgg 
atcgcgtcac 
atctctatcg 
cctgcgatgt 
agccgttgct 
tggatgagca 
tgcgctgttc 
tgtatgtggt 
ccgatgatcc 
atcgtaatca 
atcacgacgc 
aaggcggcgg 
atgaagacca 



attttggatt 

tgggaacggg 

acacatgtaa 

gaagtgacaa 

taagatttgg 

gtgttactca 

tgagttagct 

tgtgtggaat 

aattcggcgc 

tctgaggcgg 

gctccccagc 

gaaagtcccc 

caaccatagt 

attctccgcc 

cctctgagct 

agcttgggat 

agataaacag 

gcagatccat 

tattgccgta 

actgagccgc 

tttacaacgt 

tccccctttc 

gttgcgcagc 

ggaaagctgg 

gcagatgcac 

tccgccgttt 

tgaaagctgg 

tcatctgtgg 

atttgacctg 

ttggagtgac 

tgacgtctcg 

ctttaatgat 

gcgtgactac 

caccgcgcct 

actacgtctg 

tgcggtggtt 

cggtttccgc 

gattcgaggc 

gacgatggtg 

gcattatccg 

ggatgaagcc 

gcgctggcta 

cccgagtgtg 

gctgtatcgc 

agccgacacc 

gcccttcccg 



gaagccaata 
gcgggtgacg 
gcgacggatg 
ttttcgcgcg 
ccattttcgc 
tagcgcgtaa 
cactcattag 
tgtgagcgga 
agcaccatgg 
aaagaaccag 
aggcagaagt 
aggctcccca 
cccgccccta 
ccatggctga 
attccagaag 
ctctataatc 
gctgggacac 
gcacgtaaac 
agccgtggcg 
gatattgccc 
cgtgactggg 
gccagctggc 
ctgaatggcg 
ctggagtgcg 
ggttacgatg 
gttcccacgg 
ctacaggaag 
tgcaacgggc 
agcgcatttt 
ggcagttatc 
ttgctgcata 
gatttcagcc 
ctacgggtaa 
ttcggcggtg 
aacgtcgaaa 
gaactgcaca 
gaggtgcgga 
gttaaccgtc 
caggatatcc 
aaccatccgc 
aatattgaaa 
ccggcgatga 
atcatctggt 
tggatcaaat 
acggccaccg 
gctgtgccga 



tgataatgag 
tagtagtgtg 
tggcaaaagt 
gttttaggcg 
gggaaaactg 
tctctagcat 
gcaccccagg 
taacaatttc 
cctgaaataa 
ctgtggaatg 
atgcaaagca 
gcaggcagaa 
actccgccca 
ctaatttttt 
tagtgaggag 
tcgcgcaacc 
ttcacatgag 
tcgcaagccg 
gtctggtacc 
agcgtttcaa 
aaaaccctgg 
gtaatagcga 
aatggcgctt 
atcttcctga 
cgcccatcta 
agaatccgac 
gccagacgcg 
gctgggtcgg 
tacgcgccgg 
tggaagatca 
aaccgactac 
gcgctgtact 
cagtttcttt 
aaattatcga 
acccgaaact 
ccg ccgacgg 
ttgaaaatgg 
acgagcatca 
tgctgatgaa 
tgtggtacac 
cccacggcat 
gcgaacgcgt 
cgctggggaa 
ctgtcgatcc 
atattatttg 
aatggtccat 



ggggtggagt 60. 
gcggaagtgt 120 
gacgtttttg 180 
gatgttgtag 240 
aataagagga 300 
cgatgtcgac 360 
ctttacactt 420 
acacaggaaa 480 
cctctgaaag 540 
tgtgtcagtt 600 
tgcatctcaa 660 



gtatgcaaag 720 

>-! o n 



780 



960 

1020 

1080 



tcccgcccct 
ttatttatgc 840 
gcttttttgg 900 
tattttcccc 
cgaaaaatac 
actgatgcct 
gg^SSS^gaa 1140 
cgcgctgtat 1200 
cgttacccaa 1260 
agaggcccgc 1320 
tgcctggttt 1380 
ggccgatact 1440 
caccaacgta 1500 
gggttgttac 1560 
aattattttt 1620 
ttacggccag 1680 
agaaaaccgc 1740 
ggatatgtgg 1800 
acaaatcagc 1860 
ggaggcjtgaa 1920 
atggcagggt 1980 
tgagcgtggt 2040 
gtggagcgcc 2100 
cacgctgatt 2160 
tctgctgctg 2220 
tcctctgcat 2280 
gcagaacaac 2340 
gctgtgcgac 2400 
ggtgccaatg 2460 
aacgcgaatg 2520 
tgaatcaggc 2580 
ttcccgcccg 2640 
cccgatgtac 2700 
caaaaaatgg 2760 
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ctttcgctac 
aacagtcttg 
ggcggcttcg 
ccgtggtcgg 
aacggtctgg 
cagcagtttt 
ttccgtcata 
gcaagcggtg 
gaactaccgc 
aacgcgaccg 
gaaaacctca 
gaaatggatt 
tttctttcac 
ttcacccgtg 
aacgcctggg 
cagtgcacgg 
catcagggga 
atggcgatta 
ctgaactgcc 
gaaaactatc 
gacatgtata 
ttgaattatg 
caacagcaac 
ctgaatatcg 
tcggcggaat 
taataataac 
tatttaaaaa 
atgggagcct 
agtgatacgg 
tttggtctgc 
aaagcaatag 
gtttgtccaa 
agggtgggaa 
gccgccgccg 
acgcgcatgc 
cgccccgtcc 
ttggagactg 
actgactttg 
gatgacaagt 
gtttctcagc 
cccaatgcgg 
gtgtcttgct 
cggtcgttga 
agatacatgg 
tgcggggtgg 
atgtctttca 
cggttaagct 
aggttggcta 
acagtgtatc 
aacttggaga 
atgggcccac 
tgttccagga 
tgcggtataa 
cacgctttga 
tccggggtag 
cagccggtgg 
cagctgccgt 
ttttccctga 
gaagcaaagt 
ccaagcagtt 
atatctcctc 
ccagacgggc 



ctggagagac 
gcggtttegc 
tctgggactg 
cttacggcgg 
tctttgccga 
tccagttccg 
gcgataacga 
aagtgcctct 
agccggagag 
catggtcaga 

gtgtoa-csct 
tttgcatcga 
agatgtggat 
caccgctgga 
tcgaacgctg 
cagatacact 
aaaccttatt 
ccgttgatgt 
agctggcgca 
ccgaccgcct 
ccccgtacgt 
gcccacacca 
tgatggaaac 
acggtttcca 
tccagctgag 
cgggcaggcc 
acacaaactt 
acttcccgtt 
gtattatttt 
tttctgacaa 
catcacaaat 
actcatcaat 
agaatatata 
ccatgagcac 
ccccatgggc 
tgcccgcaaa 
cagcctccgc 
ctttcctgag 
tgacggctct 
agctgttgga 
tttaaaacat 
gtctttattt 
gggtcctgtg 
gcataagccc 
tgttgtagat 
gtagcaagct 
gggatgggtg 
tgttcccagc 

cggtgcactt 
cgcccttgtg 
gggcggcggc 
tgagatcgtc 
tggttccatc 
gttcagatgg 
gggagatcag 
gcccgtaaat 
catccctgag 
ccaaatccgc 
ttttcaacgg 
ccaggcggtc 
gtttcgcggg 
cagggtcatg 



gcgcccgctg 
taaatactgg 
ggtggatcag 
tgattttggc 
ccgcacgccg 
tttatccggg 
gctcctgcac 
ggatgtcgct 
cgccgggcaa 
agccgggcac 
ccccgccgcg 
gctgggtaat 
tggcgataaa 
taacgacatt 
gaaggcggcg 
tgctgatgcg 
tatcagccgg 
tgaagtggcg 
ggtagcagag 
tactgccgcc 
cttcccgagc 
gtggcgcggc 
cagccatcgc 
tatggggatt 
cgccggtcgc 
atgtctgccc 
ttggatgttc 
tttcccgatt 
tgccgctatt 
actcggaact 
ttcacaaata 
gtatcttatc 
aggtgggggt 
caactcgttt 
cggggtgcgt 
ctctactacc 
cgccgcttca 
cccgcttgca 
tttggcacaa 
tctgcgccag 
aaataaaaaa 
aggggttttg 
tattttttcc 
gtctctgggg 
gatccagtcg 
gattgccagg 
catacgtggg 
catatccctc 
gggaaatttg 
acctccaaga 
ctgggcgaag 
ataggccatt 
cggcccaggg 
ggggatcatg 
ctgggaagaa 
cacacctatt 
caggggggcc 
cagaaggcgc 
tttgagaccg 
ccacagctcg 
ttggggcggc 
tctttccacg 



atcctttgcg 
caggcgtttc 
tcgctgatta 
gatacgccga 
catccagcgc 
caaaccatcg 
tggatggtgg 
ccacaaggta 
ctctggctca 
atcagcgcct 
tcccacgcca 
aagcgttggc 
aaacaactgc 
ggcgtaagtg 
ggccattacc 
gtgctgatta 
aaaacctacc 
agcgatacac 
cgggtaaact 
tgttttgacc 
gaaaacggtc 
gacttccagt 
catctgctgc 
ggtggcgacg 
taccattacc 
gtatttcgcg 
ggtttattct 
tggctacatg 
tctctgttct 
tgtttattgc 
aagcattttt 
atgtctggat 
cttatgtagt 
gatggaagca 
cagaatgtga 
ttgacctacg 
gccgctgcag 
agcagtgcag 
ttggattctt 
caggtttctg 
ccagactctg 
cgcgcgcggt 
aggacgtggt 
tggaggtagc 
tagcaggagc 
ggcaggccct 
gatatgagat 
cggggattca 
tcatgtagct 
ttttccatgc 
atatttctgg 
tttacaaagc 
gcgtagttac 
tctacctgcg 
agcaggttcc 
accgggtgca 
acttcgttaa 
tcgccgccca 
tccgccgtag 
gtcacctgct 
tttcgctgta 
ggcgcagggt 



aatacgccca 
gtcagtatcc 
aatatgatga 
acgatcgcca 
tgacggaagc 
aagtgaccag 
cgctggatgg 
aacagttgat 
cagtacgcgt 
ggcagcagtg 
tcccgcatct 
aatttaaccg 
tgacgccgct 
aagcgacccg 
aggccgaagc 
cgaccgctca 
ggattgatgg 
cgcatccggc 
ggctcggatt 
gctgggatct 
tgcgctgcgg 
tcaacatcag 
acgcggaaga 
actcctggag 
agttggtctg 
taaggaaatc 
ttttctttta 
acatcaacca 
cgctattatt 
agcttataat 
ttcactgcat 
ccagatctgg 
tttgtatctg 
ttgtgagctc 
tgggctccag 

agaccgtgtc 
ccaccgcccg 
cttcccgttc 
tgacccggga 
ccctgaaggc 
tttggatttg 
aggcccggga 
aaaggtgact 
accactgcag 
gctgggcgtg 
tggtgtaagt 
gcatcttgga 
tgttgtgcag 
tagaaggaaa 
attcgtccat 
gatcactaac 
gcgggcggag 
cctcacagat 
gggcgatgaa 
tgagcagctg 
actggtagtt 
gcatgtccct 
gcgatagcag 
gcatgctttt 
ctacggcatc 
cggcagtagt 
cctcgtcagc 



cgcgatgggt 
ccgtttacag 
aaacggcaac 
gttctgtatg 
aaaacaccag 
cgaatacctg 
taagccgctg 
tgaactgcct 
agtgcaaccg 
gcgtctggcg 
gaccaccagc 
ccagtcaggc 
gcgcgatcag 
cattgaccct 
agcgttgttg 
cgcgtggcag 
tagtggtcaa 
gcggattggc 
agggccgcaa 
gccattgtca 
gacgcgcgaa 
ccgctacagt 
aggcacatgg 
cccgtcagta 
gtgtcaaaaa 
cattatgtac 
cttttttatc 
tatcagcaaa 
ccaaccgctg 
ggttacaaat 
tctagttgtg 
gcgtggctta 
ttttgcagca 
atatttgaca 
cattgatggt 
tggaacgccg 
cgggattgtg 
atccgcccgc 
acttaatgtc 
ttcctcccct 
gatcaagcaa 
ccagcggtct 
ctggatgttc 
agcttcatgc 
gtgcctaaaa 
gtttacaaag 
ctgtattttt 
aaccaccagc 
tgcgtggaag 
aatgatggca 
gtcatagttg 
ggtgccag ac 
ttgcatttcc 
gaaaacggtt 
cgacttaccg 
aagagagctg 
gactcgcatg 
ttcttgcaag 
gagcgtttga 
tcgatccagc 
cggtgctcgt 
gtagtctggg 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
564 0 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
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tcacggtgaa 
tgctggtgct 
tggtgtcata 
aggcgccgca 
ccgattccgg 
gccaggtgag 
gtttcttacc 
tgtccccgta 
atagaaactc 
agtgggaggg 
acatgtcgcc 
cgggtgttcc 
ccgcatcgct 
tgacttctgc 
ccgcggtgat 
tgtcaagctt 
gcagggtttg 
attcgcgcgc 
gcacgcgcca 
gtaggcgctc 
ggtctagctg 
gcgcgtcgaa 
cggcaagcgc 
cggaggcgta 
atgtagggta 
agggagcgag 
tctgcctgaa 
tggcgtctgt 
tgaccagctc 
tgtcatactt 
ggtctttcca 
tgtagaactg 
cctgcgcggc 
tgaggtactg 
ccgtgcgctt 
ttcccgcgcg 
tgttaattac 
tgtaaagttc 
aggtgagctc 
ggttggaagc 
gaaaggtcct 
gcgggtcttg 
ctagaggctc 
aggcccccat 
gatgcgagcc 
tgtggtgaaa 
gtgcgcagta 
cgcgcacaag 
cttctacttc 
ggaccaccac 
tgacaacatc 
gcgggagctc 
gatacctaat 
cccgcggcgc 
atgcatctaa 
cgggagaggg 
taggttgctg 
gaagacgacg 
gtcgttgacg 
gatctcggcc 
cacggtggcg 
tccctcgttc 



ggggtgcgct 
gaagcgctgc 
gtccagcccc 
cgaggggcag 
ggagtaggca 
ctctggccgt 
tctggtttcc 
tacagacttg 
ggaccactct 
gtagcggtcg 
ctcttcggca 
tgaagggggg 
gtctgcgagg 
gctaagattg 
gcctttgagg 
ggtggcaaac 
gtttttgtcg 
aacgcaccgc 
accgcggttg 
gttggtccag 
cgtctcgtcc 
gtagtctatc 
gcgctcgtat 
catgccgcaa 
gcatcttcca 
gaggtcggga 
gatggcatgt 
gagacctacc 
ggcggtgacc 
atcctgtccc 
gtactcttgg 
gttgacggcc 
cttccggagc 
gtatttgaag 
tttggaacgc 
aggcataaag 
ctgggcggcg 
caagaagcgc 
ttcaggggag 
gacgaatgag 
aaactggcga 
ttcccagcgg 
atctccgccg 
ccaagtatag 
gatcgggaag 
gtagaagtcc 
ctggcagcgg 
gaagcagagt 
ggctgcttgt 
gccgcgcgag 
gcgcagatgg 
ctgcaggttt 
ttccaggggc 
gactacggta 
aagcggtgac 
ggcaggggca 
gcgaacgcga 
ggcccggtga 
gcggcctggc 
atgaactgct 
gcgaggtcgt 
cagacgcggc 



ccgggctgcg 
cggtcttcgc 
tccgcggcgt 
tgcagacttt 
tccgcgccgc 
tcggggtcaa 
atgagccggt 
agaggcctgt 
gagacaaagg 
ttgtccacta 
tcaaggaagg 
ctataaaagg 
gccagctgtt 
tcagtttcca 
gtggccgcat 
gacccgtaga 
cgatcggcgc 
cattcgggaa 
tgcagggtga 
cagaggcggc 

gggggg tGt 9 

ttgcatcctt 

gggttgagtg 

atgtcgtaaa 

ccgcggatgc 

ccgaggttgc 

gagttggatg 

gcgtcacgca 

tgcacgtcta 

ttttttttcc 

atcggaaacc 

tggtaggcgc 

gaggtgtggg 

tcagtgtcgt 
ggatttggca 
ttgcgtgtga 
agcacgatct 
gggatgccct 
ctgagcccgt 
ctccacaggt 
cctatggcca 
tcccatccaa 
aacttcatga 
gtctctacat 
aactggatct 
ctgcgacggg 
tgcacgggct 
gggaatttga 
ccttgaccgt 
cccaaagtcc 
gagctgtcca 
acctcgcata 
tggttggtgg 
ccgcgcggcg 
gcgggcgagc 
cgtcggcgcc 
cgacgcggcg 
gcttgagcct 
gcaaaatctc 
cgatctcttc 
tggaaatgcg 
tgtagaccac 



cgctggccag 
cctgcgcgtc 
ggcccttggc 
tgagggcgta 
aggccccgca 
aaaccaggtt 
gtccacgctc 
cctcgagcgg 
ctcgcgtcca 
gggggtccac 
tgattggttt 
gggtgggggc 
ggggtgagta 
aaaacgagga 
ccatctggtc 
gggcgttgga 
gctccttggc 
agacggtggt 
caaggtcaac 
cgcccttgcg 
cgtccacggt 
gcaagtctag 
ggggacc cca 
cgtagagggg 
tggcgcgcac 
tacgggcggg 
atatggttgg 
cgaaggaggc 

ggg c g ca 9 ta 

acagctcgcg 
cgtcggcctc 
agcatccctt 
tgagcgcaaa 
cgcatccgcc 
gggcgaaggt 
tgcggaaggg 
cgtcaaagcc 
tgatggaagg 
gctctgaaag 
cacgggccat 
ttttttctgg 
ggttcgcggc 
ccagcatgaa 
cgtaggtgac 
cccgccacca 
ccgaacactc 
gtacatcctg 
gcccctcgcc 
ctggctgctc 
agatgtccgc 
tggtctggag 
gacgggtcag 
cggcgtcgat 
ggcggtgggc 
ccccggaggt 
gcgcgcgggc 
gttgatctcc 
gaaagagagt 
ctgcacgtct 
ctcctggaga 
ggccatgagc 
gcccccttcg 



ggtgcgcttg 
ggccaggtag 
gcgcagcttg 
gagcttgggc 
gacggtctcg 
tcccccatgc 
ggtgacgaaa 
tgttccgcgg 
ggccagcacg 
tcgctccagg 
gtaggtgtag 
gcgttcgtcc 
ctccctctga 
ggatttgata 
agaaaagaca 
cagcaacttg 
cgcgatgttt 
gcgctcgtcg 
gctggtggct 
cgagcagaat 
aaagaccccg 
cgcctgctgc 
tggcatgggg 
ctctctgagt 
gtaatcgtat 
ctgctctgct 
acgctggaag 
gtaggagtcg 
gtccagggtt 
gttgaggaca 
cgaacggtaa 
ttctacgggt 
ggtgtccctg 
ctgctcccag 
gacatcgttg 
tcccggcacc 
gttgatgttg 
caatttttta 
ggcccagtct 
tagcatttgc 
ggtgatgcag 
taggtctcgc 
gggcacgagc 
aaagagacgc 
attggaggag 
gtgctggctt 
cacgaggttg 
tggcgggttt 
gaggggagtt 
gcgcggcggt 
ctcccgcggc 
ggcgcgggct 
ggcttgcaag 
cgcgggggtg 
agggggggct 
aggagctggt 
tgaatctggc 
tcgacagaat 
cctgagttgt 
tctccgcgtc 
tgcgagaagg 
gcatcgcggg 



aggctggtcc 
catttgacca 
cccttggagg 
gcgagaaata 
cattccacga 
tttttgatgc 
aggctgtccg 
tcctcctcgt 
aaggaggcta 
gtgtgaagac 
gccacgtgac 
tcactctctt 
aaagcgggca 
ttcacctggc 
atctttttgt 
gcgatggagc 
agctgcacgt 
ggcaccaggt 
acctctccgc 
ggcggtaggg 
ggcagcaggc 
catgcgcggg 
tgggtgagcg 
attccaagat 
agttcgtgcg 
cggaagacta 
acgttgaagc 
cgcagcttgt 
tccttgatga 
aactcttcgc 
gagcctagca 
agcgcgtatg 
accatgactt 
agcaaaaagt 
aagagtatct 
tcggaacggt 
tggcccacaa 
agttcctcgt 
gcaagatgag 
aggtggtcgc 
tagaaggtaa 
gcggcagtca 
tgcttcccaa 
tcggtgcgag 
tggctattga 
ttgtaaaaac 
acctgacgac 
ggctggtggt 
acggtggatc 
cggagcttga 
gtcaggtcag 
agate caggt 
aggccgcatc 
tccttggatg 

ccggacccgc 
gctgcgcgcg 
gcctctgcgt 
caatttcggt 
ettgatagge 
cggctcgctc 
cgttgaggcc 
cgcgcatgac 
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cacctgcgcg 

aaagaggtag 

tcgcaacgtg 

gtccacggcg 

aagacggatg 

ttcttcttct 

tgggggaggg 

gatcatctcc 

gcgcagttgg 

cggcagggat 

gagggacctg 

ccagtcacag 

gttgtttctg 

gatggtcgac 

catgccccag 

ttctaccggc 

ggcggcggcg 

gcccctcatc 

ctgcacctgc 

cgtgttgatg 

ctgcgagagc 

gcaagtccgc 

gggccagcgt 

tccgtagatg 

gtcgcggacg 

ctggccggtc 

agcgggcact 

gggttcgagc 

aacccaggtg 

ggcggctgct 

gaaagcgaaa 

agtcgcggga 

ctccccgtca 

ttgcttttcc 

agagcaagag 

ggcgacatcc 

ccggcactac 

tgagcggtac 

gaacctgttt 

cgcagggcgc 

tgagcccgac 

ggtaaccgca 

ccacgtgcgt 

ctttgtaagc 

tatagtgcag 

gcccgagggc 

gcgcagcttg 

caagttttac 

gatcgagggg 

cgtttatcgc 

cgaccgcgag 

agaggccgag 

cctggaggca 

cggcggcgtg 

agcggtgatg 

ctgcagagcc 

atcatgtcgc 

ctctccgcaa 

ctggcgatGg 

gtctacgacg 

ctgga c cggc 

cagggcaacc 



agattgagct 

ttgagggtgg 

gattcgttga 

aagttgaaaa 

agctcggcga 

tcaatctcct 

gggacacggc 

ccgcggcgac 

aagacgccgc 

acggcgctaa 

agcgagtccg 

tcgcaaggta 

gcggaggtgc 

agaagcacca 

gcttcgtttt 

acttcttctt 

gagtttggcc 

ggctgaagca 

gtgagggtag 

gtgtaagtgc 

tcggtgtacc 

accaggtact 

agggtggccg 

tacctggaca 

cggttccaga 

aggcgcgcgc 

cttccgtggt 

cccgtatccg 

tgcgacgtca 

gcgctagctt 

gcattaagtg 

cccccggttc 

tgcaagaccc 

cagatgcatc 

cagcggcaga 

gcggttgacg 

ctggacttgg 

ccaagggtgc 

cgcgaccgcg 

gagctgcggc 

gcgcgaaccg 

tacgagcaga 

acgcttgtgg 

gcgctggagc 

cacagcaggg 

cgctggctgc 

agcctggctg 

gcccgcaaga 

ttctacatgc 

aacgagcgca 

ctgatgcaca 

tcctactttg 

gctggggccg 

gaggaatatg 

tttctgatca 

agccgtccgg 

tgactgcgcg 

ttctggaagc 

taaacgcgct 

cgctgcttca 

tggtggggga 

tgggctccat 



ccacgtgccg 

tggcggtgtg 

tatcccccaa 

actgggagtt 

cagtgtcgcg 

cttccataag 

ggcgacgacg 

ggcgcatggt 

ccgtcatgtc 

cgatgcatct 

catcgaccgg 

ggctgagcac 

tgctgatgat 

tgtccttggg 

gacatcggcg 

ctccttcctc 

gtaggtggcg 

gggctaggtc 

actggaagtc 

agttggccat 

tgagacgcga 

ggtatcccac 

gggctccggg 

tccaggtgat 

tgttgcgcag 

aatcgttgac 

ctggtggata 

gccgtccgcc 

gacaacgggg 

ttttggccac 

gctcgctccc 

gagtctcgga 

cgcttgcaaa 

cggtgctgcg 

catgcagggc 

cggcagcaga 

aggagggcga 

agctgaagcg 

agggagagga 

atggcctgaa 

ggattagtcc 

cggtgaacca 

cgcgcgagga 

aaaacccaaa 

acaacgaggc 

tcgatttgat 

acaaggtggc 

tataccatac 

gcatggcgct 

tccacaaggc 

gcctgcaaag 

acgcgggcgc 

gacctgggct 

acgaggacga 

gatgatgcaa 

ccttaactcc 

caatcctgac 

ggtggtcccg 

ggccgaaaac 

gcgcgtggct 

tgtgcgcgag 

ggttgcacta 



ggcgaagacg 

ttctgccacg 

ggcctcaagg 

gcgcgccgac 

cacctcgcgc 

ggcctcccct 

gcgcaccggg 

ctcggtgacg 

ccggttatgg 

caacaattgt 

atcggaaaac 

cgtggcgggc 

gtaattaaag 

tccggcctgc 

caggtctttg 

ttgtcctgca 

ccctcttcct 

ggcgacaacg 

atccatgtcc 

aacggaccag 

gtaagccctc 

caaaaagtgc 

ggcgagatct 

gccggcggcg 

cggcaaaaag 

gctctagacc 

aattcgcaag 

gtgatccatg 

gagtgctcct 

tggccgcgcg 

tgtagccgga 

ccggccggac 

ttcctccgga 

gcagatgcgc 

accctcccct 

tggtgattac 

gggcctggcg 

tgatacgcgt 

gcccgaggag 

tcgcgagcgg 

cgcgcgcgca 

ggagattaac 

ggtggctata 

tagcaagccg 

attcagggat 

aaacatcctg 

cgccatcaac 

cccttacgtt 

gaaggtgctt 

cgtgagcgtg 

ggccctggct 

tgacctgcgc 

ggcggtggca 

tgagtacgag 

gacgcaacgg 

acggacgact 

gcgttccggc 

gcgcgcgcaa 

agggccatcc 

cgttacaaca 

gccgtggcgc 

aacgccttcc 



gcgtagtttc 
aagaagtaca 
cgctccatgg 
acggttaact 
tcaaaggcta 
tcttcttctt 
aggcggtcga 
gcgcggccgt 
gttggcgggg 
tgtgtaggta 
ctctcgagaa 
ggcagcgggc 
taggcggtct 
tgaatgcgca 
tagtagtctt 
tctcttgcat 
cccatgcgtg 
cgctcggcta 
acaaagcggt 
ttaacggtct 
gagtcaaata 
ggcggcggct 
tccaacataa 
gtggtggagg 
tgctccatgg 
gtgcaaaagg 
ggtatcatgg 
cggttaccgc 
tttggcttcc 
cagcgtaagc 
gggttatttt 
tgcggcgaac 
aacagggacg 
ccccctcctc 
cctcctaccg 
gaacccccgc 
cggctaggag 
gaggcgtacg 
atgcgggatc 
ttgctgcgcg 
cacgtggcgg 
tttcaaaaaa 
ggactgatgc 
ctcatggcgc 
gcgctgctaa 
cagagcatag 
tattccatgc 
cccatagaca 
accttgagcg 
agccggcggc 
ggcacgggca 
tgggccccaa 
cccgcgcgcg 
ccagaggacg 
acccggcggt 
ggcgccaggt 
agcagccgca 
accccacgca 
ggcccgacga 
gcggcaacgt 
agcgtgagcg 
tgagtacaca 



gcaggcgctg 

taacccagcg 

cctcgtagaa 

cctcctccag 

caggggccrtc 

ctggcggcgg 

caaagcgctc 

tctcgcgggg 

ggctgccatg 

ctccgccgcc 

aggcgtctaa 

ggcggtcggg 

tgagacggcg 

ggcggtcggc 

gcatgagcct 

ctatcgctgc 

tgaccccgaa 

atatggcctg 

ggtatgcgcc 

ggtgacccgg 

cgtagtcgtt 

ggcggtagag 

ggcgatgata 

cgcgcggaaa 

tcgggacgct 

agagcctgta 

cggacgaccg 

ccgcgtgtcg 

ttccaggcgc 

ggttaggctg 

ccaagggttg 

gggggtttgc 

agcccctttt 

agcagcggca 

cgtcaggagg 

ggcgccgggc 

cgccctctcc 

tgccgcggca 

gaaagttcca 

aggaggactt 

ccgccgacct 

gctttaacaa 

atctgtggga 

agctgttcct 

acatagtaga 

tggtgcagga 

ttagcctggg 

aggaggtaaa 

acgacctggg 

gcgagctcag 

gcggcgatag 

gccgacgcgc 

ctggcaacgt 

gcgagtacta 

gcgggcggcg 

catggaccgc 

ggccaaccgg 

cgagaaggtg 

ggccggcctg 

gcagaccaac 

cgcgcagcag 

gcccgccaac 



10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 




WO 01/83729 



25 



PCT/EP01/04863 



gtgccgcggg 
gagacaccgc 
caaggcctgc 
gtgcgggctc 
ctgttgctgc 
ctaggtcact 
actttccagg 
gaggcaaccc 
ttaaacagcg 
atgcgcgacg 
ggcatgtatg 
gccgccgtga 
cctggtttct 
gacatagacg 
gagcaggcag 
ctaggcgctg 
cttaccagca 
tcgctgctgc 
gagagcctag 
ccaggcccgc 
gaggacgatg 
tttgcgcacc 
aataaaaaac 
tgcggcgcgc 
cggcgccagt 
tgcctccgcg 
cacccctatt 
ccctgaacta 
acagcccggg 
gcgacctgaa 
ataagtttaa 
tgaaatacga 
tagaccttat 
ttctggaaag 
ccgtcactgg 
ttttgctgcc 
tccgcaagcg 
gtaacattcc 
aacagggcgg 
ccaacgcggc 
gcgacacctt 
ctgccgcccc 
ccctgacaga 
cccagtaccg 
catggaccct 
tgccagacat 
cggtggtggg 
tctactccca 
agaaccagat 
ctgctctcac 
tgaccattac 
tctcgccgcg 
ccagcaataa 
agcgctccga 
acaaacgcgg 
aggcgcgcaa 
ccgtggtgcg 
gtcgccaccg 
gcgcacgtcg 
ttgtcactgt 
gtgctatgac 
tgcgcgtgcc 



gacaggagga 
aaagtgaggt 
agaccgtaaa 
ccacaggcga 
tgctaatagc 
tgctgacact 
agattacaag 
taaactacct 
aggaggagcg 
gggtaacgcc 
cctcaaaccg 
accccgagta 
acaccggggg 
acagcgtgtt 
aggcggcgct 
cggccccgcg 
ctcgcaccac 
agccgcagcg 
tggacaagat 
gcccgcccac 
actcggcaga 
ttcgccccag 
tcaccaaggc 
ggcgatgtat 
ggcggcggcg 
gtacctgcgg 
cgacaccacc 
ccagaacgac 
ggaggcaagc 
aaccatcctg 
ggcgcgggtg 
gtgggtggag 
gaacaacgcg 
cgacatcggg 
tcttgtcatg 
aggatgcggg 
gcaacccttc 
cgcactgttg 
gggtggcgca 
agccgcggca 
tgccacacgg 
cgctgcgcaa 
ggacagcaag 
cagctggtac 
gctttgcact 
gatgcaagac 
cgccgagctg 
actcatccgc 
tttggcgcgc 
agatcacggg 
tgacgccaga 
cgtcctatcg 
cacaggctgg 
ccaacaccca 
ccgcactggg 
ctacacgccc 
cggagcccgg 
ccgccgaccc 
caccggccga 
gccccccagg 
tcagggtcgc 
cgtgcgcacc 



ctacaccaac 
gtaccagtct 
cctgagccag 
ccgcgcgacc 
gcccttcacg 
gtaccgcgag 
tgtcagccgc 
gctgaccaac 
cattttgcgc 
cagcgtggcg 
gccgtttatc 
tttcaccaat 
attcgaggtg 
ttccccgcaa 
gcgaaaggaa 
gtcagatgct 
ccgcccgcgc 
cgaaaaaaac 
gagtagatgg 
ccgtcgtcaa 
cgacagcagc 
gctggggaga 
catggcaccg 
gaggaaggtc 
ctgggttctc 
cctaccgggg 
cgtgtgtacc 
cacagcaact 
acacagacca 
cataccaaca 
atggtgtcgc 
ttcacgctgc 
atcgtggagc 
gtaaagtttg 
cctggggtat 
gtggacttca 
caggagggct 
gatgtggacg 
ggcggcagca 
atgcagccgg 
gctgaggaga 
cccgaggtcg 
aaacgcagtt 
cttgcataca 
cctgacgtaa 
cccgtgacct 
ttgcccgtgc 
cagtttacct 
ccgccagccc 
acgctaccgc 
cgccgcacct 
agccgcactt 
ggcctgcgct 
gtgcgcgtgc 
cgcaccaccg 
acgccgccac 
cgctatgcta 
ggcactgccg 
cgggcggcca 
tccaggcgac 
aggggcaacg 
cgccccccgc 



tttgtgagcg 
gggccagact 
gctttcaaaa 
gtgtctagct 
gacagtggca 
gccataggtc 
gcgctggggc 
cggcggcaga 
tacgtgcagc 
ctggacatga 
aaccgcctaa 
gccatcttga 
cccgagggta 
ccgcagaccc 
agcttccgca 
agtagcccat 
ctgctgggcg 
ctgcctccgg 
aagacgtacg 
aggcacgacc 
gtcctggatt 
atgttttaaa 
agcgttggtt 
ctcctccctc 
ccttcgatgc 
ggagaaacag 
tggtggacaa 
ttctgaccac 
tcaatcttga 
tgccaaatgt 
gcttgcctac 
ccgagggcaa 
actacttgaa 
acacccgcaa 
atacaaacga 
cccacagccg 
ttaggatcac 
cctaccaggc 
acagcagtgg 
tggaggacat 
agcgcgctga 
agaagcctca 
acaacctaat 
actacggcga 
cctgcggctc 
tccgctccac 
actccaagag 
ctctgaccca 
ccaccatcac 
tgcgcaacag 
gcccctacgt 
tttgagcaag 
tcccaagcaa 
gcgggcacta 
tcgatgacgc 
cagtgtccac 
aaatgaagag 
cccaacgcgc 
tgcgggccgc 
gagcggccgc 
tgtattgggt 
gcaactagat 



cactgcggct 
attttttcca 
acttgcaggg 
tgctgacgcc 
gcgtgtcccg 
aggcgcatgt 
aggaggacac 
agatcccctc 
agagcgtgag 
ccgcgcgcaa 
tggactactt 
acccgcactg 
acgatggatt 
tgctagagtt 
ggccaagcag 
ttccaagctt 
aggaggagta 
catttcccaa 
cgcaggagca 
gtcagcgggg 
tgggagggag 
aaaaaaaaag 
ttcttgtatt 
ctacgagagt 
tcccctggac 
catccgttac 
caagtcaacg 
ggtcattcaa 
cgaccggtcg 
gaacgagttc 
taaggacaat 
ctactccgag 
agtgggcaga 
cttcagactg 
agccttccat 
cctgagcaac 
ctacgatgat 
gagcttgaaa 
cagcggcgcg 
gaacgatcat 
ggccgaagca 
gaagaaaccg 
aagcaatgac 
ccctcagacc 
ggagcaggtc 
gcgccagatc 
cttctacaac 
cgtgttcaat 
caccgtcagt 
catcggagga 
ttacaaggcc 
catgtccatc 
gatgtttggc 
ccgcgcgccc 
catcgacgcg 
agtggacgcg 
acggcggagg 
ggcggcggcc 
tcgaaggctg 
cgcagcagcc 
gcgcgactcg 
tgcaagaaaa 



aatggtgact 
gaccagtaga 
gctgtggggg 
caactcgcgc 
ggacacatac 
ggacgagcat 
gggcagcctg 
gttgcacagt 
ccttaacctg 
catggaaccg 
gcatcgcgcg 
gctaccgccc 
cctctgggac 
gcaacagcgc 
cttgtccgat 
gatagggtct 
cctaaacaac 
caacgggata 
cagggacgtg 
tctggtgtgg 
tggcaacccg 
catgatgcaa 
ccccttagta 
gtggtgagcg 
ccgccgtttg 
tctgagttgg 
gatgtggcat 
aacaatgact 
cactggggcg 
atgtttacca 
caggtggagc 
accatgacca 
cagaacgggg 
gggtttgacc 
ccagacatca 
ttgttgggca 
ctggagggtg 
gatgacaccg 
gaagagaact 
gccattcgcg 
gcggccgaag 
gtgatcaaac 
agcaccttca 
ggaatccgct 
tactggtcgt 
agcaactttc 
gaccaggccg 
cgctttcccg 
gaaaacgttc 
gtccagcgag 
ctgggcatag 
cttatatcgc 
ggggecaaga 
tggggcgcgc 
gtggtggagg 
gccattcaga 
cgcgtagcac 
ctgcttaacc 
gccgcgggta 
gcggccatta 
gttagcggcc 
aactacttag 
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actcgtactg 
aaatcaaaga 
aagagcagga 
atgatgaact 
agtggaaagg 
ccggtgagcg 
acctgcttga 
acatgctggc 
tgcagcaggt 
ctggtgactt 
tcttggaaaa 
agcaggtggc 
gcaccagtat 
cggtggcgga 
tgcaaacgga 
agtacggcgc 
cccccggcta 
ccaccactgg 
tgcgcagggt 
ccagcatcgt 
tccgtttccc 
acggcctgac 
gcatgcgcgg 
tgcccggaat 
tgtggaaaaa 
tgtagaatgg 
atgggaaact 
tcgctgtgga 
tggaacagca 
aaggtggtag 
gtgcaaaata 
gccgtggaga 
gaaactctgg 
ctgcccacca 
gtaacgctgg 
accgccgttg 
cgatcgttgc 
ctgggggtgc 
tgtcatgtat 
ccaagatggc 
acgcctcgga 
tcagcctgaa 
accggtccca 
cgtacaaggc 
cgtactttga 
ctgcctacaa 
ctactgctct 
agcaagctga 
ttacaaagga 
catttcaacc 
cagctgggag 
aacccacaaa 
aaagtcaagt 
acttgactcc 
atatttctta 
ctatgcccaa 
acaacagcac 
tagatttgca 
atagaaccag 
gaattattga 
gtgtgattaa 
gggaaaaaga 



ttgtatgtat 
agagatgctc 
ttacaagccc 
tgacgacgag 
tcgacgcgta 
ctccacccgc 
gcaggccaac 
gttgccgctg 
gctgcccgcg 
ggcacccacc 
aatgaccgtg 
gccgggactg 
tgccaccgcc 
tgccgcggtg 
cccgtggatg 
cgccagcgcg 
tcgtggctac 
aacccgccgc 
ggctcgcgaa 
ttaaaagccg 
ggtgccggga 
gggcggcatg 
cggtatcctg 
tgcatccgtg 
tcaaaataaa 
aagacatcaa 
ggcaagatat 
gcggcattaa 
gcacaggcca 
atggcctggc 
agattaacag 
cagtgtctcc 
tgacgcaaat 
cccgtcccat 
acctgcctcc 
ttgtaacccg 
ggcccgtagc 
aatccctgaa 
gcgtccatgt 
taccccttcg 
gtacctgagc 
taacaagttt 
gcgtttgacg 
gcggttcacc 
catccgcggc 
cgccctggct 
tgaaataaac 
gcagcaaaaa 
gggtattcaa 
tgaacctcaa 
agtccttaaa 
tgaaaatgga 
ggaaatgcaa 
taaagtggta 
catgcccact 
caggcctaat 
gggtaatatg 
agacagaaac 
gtacttttct 
aaatcatgga 
tacagagact 
tgctacagaa 



ccagcggcgg 
caggtcatcg 
cgaaagctaa 
gtggaactgc 
aaacgtgtt-t 
acctacaagc 
gagcgcctcg 
gacgagggca 
cttgcaccgt 
gtgcagctga 
gaacctgggc 
ggcgtgcaga 
acagagggca 
caggcggtcg 
tttcgcgttt 
ctactgcccg 
acctaccgcc 
cgccgtcgcc 
ggaggcagga 
gtctttgtgg 
ttccgaggaa 
cgtcgtgcgc 
cccctcctta 
gccttgcagg 
aagtctggac 
ctttgcgtct 
cggcaccagc 
aaatttcggt 
gatgctgagg 
ctctggcatt 
taagcttgat 
agaggggcgt 
agacgagcct 
cgcgcccatg 
ccccgccgac 
tcctagccgc 
cagtggcaac 
gcgccgacga 
cgccgccaga 
atgatgccgc- 
cccgggctgg 
agaaaccc ca 
ctgcggttca 
ctagctgtgg 
gtgctggaca 
cccaagggtg 
ctagaagaag 
actcacgtat 
ataggtgtcg 
ataggagaat 
aagactaccc 
gggcaaggca 
tttttctcaa 
ttgtacagtg 
attaaggaag 
tacattgctt 
ggtgttctgg 
acagagcttt 
atgtggaatc 
actgaagatg 
cttaccaagg 
ttttcagata 



cggcgcgcaa 
cgccggagat 
agcgggtcaa 
tgcacgctac 
tgcgacccgg 
gcgtgtatga 
gggagtttgc 
acccaacacc 
ccgaagaaaa 
tggtacccaa 
tggagcccga 
ccgtggacgt 
tggagacaca 
ctgcggccgc 
cagccccccg 
aatatgccct 
ccagaagacg 
gtcgccagcc 
ccctggtgct 
ttcttgcaga 
gaatgcaccg 
accaccggcg 
ttccactgat 
cgcagagaca 
tctcacgctc 
ctggccccgc 
aatatgagcg 
tccaccgtta 
gataagttga 
agcggggtgg 
ccccgccctc 
ggcgaaaagc 
ccctcgtacg 
gctaccggag 
acccagcaga 
gcgtccctgc 
tggcaaagca 
tgcttctgaa 
ggagctgctg 
• agtggtctta 
tgcagtttgc 
cggtggcgcc 
tccctgtgga 
gtgataaccg 

gggg ccctac 

ccccaaatcc 
aggacgatga 
ttgggcaggc 
aaggtcaaac 
ctcagtggta 
caatgaaacc 
ttcttgtaaa 
ctactgaggc 
aagatgtaga 
gtaactcacg 
ttagggacaa 
cgggccaagc 
cataccagct 
aggctgttga 
aacttccaaa 
taaaacctaa 
aaaatgaaat 



cgaagctatg 
ctatggcccc 
aaagaaaaag 
cgcgcccagg 
caccaccgta 
tgaggtgtac 
ctacggaaag 
tagcctaaag 
gcgcggccta 
gcgccagcga 
ggtccgcgtg 
tcagataccc 
aacgtccccg 
gtccaagacc 
gcgcccgcgc 
acatccttcc 
agcaactacc 
cgtgctggcc 
gccaacagcg 
tatggccctc 
taggaggggc 
gcggcgcgcg 
cgccgcggcg 
ctgattaaaa 
gcttggtcct 
gacacggctc 
gtggcgcctt 
agaactatgg 
aagagcaaaa 
tggacctggc 
ccgtagagga 
gtccgcgccc 
aggaggcact 
tgctgggcca 
aacctgtgct 
gccgcgccgc 
cactgaacag 
tagctaacgt 
agccgccgcg 
catgcacatc 
ccgcgccacc 
tacgcacgac 
ccgtgaggat 
tgtgctggac 
ttttaagccc 
ttgcgaatgg 
caacgaagac 
gccttattct 
acctaaatat 
cgaaactgaa 
atgttacggt 
gcaacaaaat 
gaccgcaggc 
tatagaaacc 
agaactaatg 
ttttattggt 
atcgcagttg 
tttgcttgat 
cagctatgat 
ttactgcttt 
aacaggtcag 
aagagttgga 



tccaagcgca 17700 
ccgaagaagg 17760 
aaagatgatg 17820 
cgacgggtac 17880 
gtctttacgc 17940 
ggcgacgagg 18000 
cggcataagg 18060 
cccgtaacac 18120 
aagcgcgagt 18180 
ctggaagatg 18240 
cggccaatca 18300 
actaccagta 18360 
gttgcctcag 18420 
tctacggagg 18480 
ggttcgagga 18540 
attgcgccta 18600 
cgacg cc gaa 18660 
ccgatttccg 18720 
cgctaccacc 18780 
acctgccgcc 18840 
atggccggcc 18900 
tcgcaccgtc 18960 
attggcgccg 19020 
acaagttgca 19080 
gtaactattt 19140 
gcgcccgttc 19200 
cagctggggc 19260 
cagcaaggcc 19320 
tttccaacaa 19380 
caaccaggca 19440 
gcctccaccg 19500 
cgacagggaa 19560 
aaagcaaggc 19620 
gcacacaccc 19680 
gccaggcccg 19740 
cagcggtccg 19800 
catcgtgggt 19860 
gtcgtatgtg 19920 
cgcccgcttt 19980 
tcgggccagg 20040 
gagacgtact 20100 
gtgaccacag 20160 
actgcgtact 20220 
atggc:tt cc a 20280 
tactctggca 20340 
gatgaagctg 20400 
gaagtagacg 20460 
ggtataaata 20520 
gccgataaaa 20580 
attaatcatg 20640 
tcatatgcaa 20700 
ggaaagctag 20760 
aatggtgata 20820 
ccagacactc 20880 
ggccaacaat 20940 
ctaatgtatt 21000 
aatgctgttg 21060 
tccattggtg 21120 
ccagatgtta 21180 
ccactgggag 21240 
gaaaatggat 21300 
aataattttg 21360 
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ccatggaaat 
tgtatttgcc 
acacctacga 
accttggagc 
gcaatgctgg 
acatccaggt 
acacctacga 
atgacctaag 
tcttccccat 
acgaccagtc 
acgctaccaa 
ccttcacgcg 
acacctactc 
agaaggtggc 
cccccaacga 
gtaacatgac 
agggcttcta 
agcccatgag 
tcctacacca 
gacaggccta 
ttacccagaa 
ttatgtccat 
acgcgctaga 
tgtttgaagt 
tgtacctgcg 
aacaacagct 
tggttgtggg 
acacaagctc 
gatggccttt 
ttctgaccag 
cgccattgct 
ggggcccaac 
ctggccccaa 
ctccatgctc 
cagcttcctg 
cacttctttt 
aggcaaatgc 
cgccgtttaa 
gttgcgatac 
ggtgaagttt 
cgatatcttg 
gttgcagcac 
ggagatcaga 
tagctgcctt 
catcaaaagg 
gatctgctta 
gccggaaaac 
ggagatctgc 
ctccttcagc 
atttatcata 
cagccacaac 
caggtacgcc 
cagctgcaac 
cacttggtca 
catcagcgcg^ 
cgggttcatc 
ccgcatacca 
tttgccatgc 
ttctctttct 
agaagggcgc 
ccgcgggctg 
ctcgatacgc 



caatctaaat 
cgacaagcta 
ctacatgaac 
acgctggtcc 
cctgcgctac 
gcctcagaag 
gtggaacttc 
ggttgacgga 
ggcccacaac 
ctttaacgac 
cgtgcccata 
ccttaagact 
tggctctata 
cattaccttt 
gtttgaaatt 
caaagactgg 
tatcccagag 
ccgtcaggtg 
acacaacaac 
ccctgctaac 
aaagtttctt 
gggcgcactc 
catgactttt 
ctttgacgtg 
cacgcccttc 
gccgccatgg 
ccatattttt 
gcctgcgcca 
gcctggaacc 
cgactcaagc 
tcttcccccg 
tcggccgcct 
actcccatgg 
aacagtcccc 
gagcgccact 
tgtcacttga 
ttttatttgt 
aaatcaaagg 
tggtgtttag 
tcactccaca 
aagtcgcagt 
tggaacacta 
tccgcgtcca 
cccaaaaagg 
tgaccgtgcc 
aaagccacct 
tgattggccg 
accacatttc 
gcgcgctgcc 
atgcttccgt 
gcgcagcccg 
tgcaggaatc 
ccgcggtgct 
ggcagtagtt 
cgcgcagcct 
accgtaattt 
cgcgccactg 
ttgattagca 
tcctcgctgt 
ttctttttct 
ggtgtgcgcg 
cgcctcatcc 



gccaacctgt 
aagtacagtc 
aagcgagtgg 
cttgactata 
cgctcaatgt 
ttctttgcca 
aggaaggatg 
gccagcatta 
accgcctcca 
tatctctccg 
tccatcccct 
aaggaaaccc 
ccctacctag 
gactcttctg 
aagcgctcag 
ttcctggtac 
agctacaagg 
gtggatgata 
tctggatttg 
ttcccctatc 
tgcgatcgca 
acagacctgg 
gaggtggatc 
gtccgtgtgc 
tcggccggca 
gctccagtga 
tgggcaccta 
tagtcaatac 
cgcactcaaa 
aggtttacca 
accgctgtat 
gtggactatt 
atcacaaccc 
aggtacagcc 
cgccctactt 
aaaacatgta 
acactctcgg 
ggttctgccg 
tgctccactt 
ggctgcgcac 
tggggcctcc 
tcagcgccgg 
ggtcctccgc 
gcgcgtgccc 
cggtctgggc 
gagcctttgc 
gacaggccgc 
ggccccaccg 
cgttttcgct 
gtagacactt 
tgggctcgtg 
gccccatcat 
cctcgttcag 
tgaagttcgc 
ccatgccctt 
cactttccgc 
ggtcgtcttc 
ccggtgggtt 
ccacgattac 
tcttgggcgc 
gcaccagcgc 
gcttttttgg 



ggagaaattt 
cttccaacgt 
tggctcccgg 
tggacaacgt 
tgctgggcaa 
ttaaaaacct 
ttaacatggt 
agtttgatag 
cgcttgaggc 
ccgccaacat 
cccgcaactg 
catcactggg 
atggaacctt 
tcagctggcc 
ttgacgggga 
aaatgctagc 
accgcatgta 
ctaaatacaa 
ttggctacct 
cgcttatagg 
ccctttggcg 
gccaaaacct 
ccatggacga 
accggccgca 
acgccacaac 
gcaggaactg 
tgacaagcgc 
ggccggtcgc 
aacatgctac 
gtttgagtac 
aacgctggaa 
ctgctgcatg 
caccatgaac 
caccctgcgt 
ccgcagccac 
aaaataatgt 
gtgattattt 
cgcatcgcta 
aaactcaggc 
catcaccaac 
gccctgcgcg 
gtggtgcacg 
gttgctcagg 
aggctttgag 
gttaggatac 
gccttcagag 
gtcgtgcacg 
gttcttcacg 
cgtcacatcc 
aagctcgcct 
atgcttgtag 
cgtcacaaag 
ccaggtcttg 
ctttagatcg 
ctcccacgca 
ttcgctgggc 
attcagccgc 
gctgaaaccc 
ctctggtgat 
aatggccaaa 
gtcttgtgat 
gggcgcccgg 



cctgtactcc 
aaaaattt c t 
gttagtggac 
caacccattt 
tggtcgctat 
ccttctcctg 
tctgcagagc 
catttgcctt 
catgcttaga 
gctctaccct 
ggcggctttc 
ctcgggctac 
ttacctcaac 
tggcaatgac 
gggttacaac 
taactacaac 
ctccttcttt 
ggactaccaa 
tgcccccacc 
caagaccgca 
catcccattc 
tctctacgcc 
gcccaccctt 
ccgcggcgtc 
ataaagaagc 
aaagccattg 
tttccaggct 
gagactgggg 
ctctttgagc 
gagtcactcc 
aagtccaccc 
tttctccacg 
cttattaccg 
cgcaaccagg 
agtgcgcaga 
actagagaca 
acccccaccc 
tgcgccactg 
acaaccatcc 
gcgtttagca 
cgcgagttgc 
ctggccagca 
gcgaacggag 
ttgcactcgc 
agcgcctgca 
aagaacatgc 
cagcaccttg 
atcttggcct 
atttcaatca 
tcgatctcag 
gtcacctctg 
gtcttgttgc 
catacggccg 
ttatccacgt 
gacacgatcg 
tcttcctctt 
cgcactgtgc 
accatttgta 
ggcgggcgct 
tccgccgccg 
gagtcttcct 
ggaggcggcg 



aacatagcgc 
gataacccaa 
tgctacatta 
aaccaccacc 
gtgcccttcc 
ccgggctcat 
tccctaggaa 
tacgccacct 
aacgacacca 
atacccgcca 
cgcggctggg 

gacccttatt 
cacaccttta 
cgcctgctta 
gttgcccagt 
attggctacc 
agaaacttcc 
caggtgggca 
atgcgcgaag 
gttgacagca 
tccagtaact 
aactccgccc 
ctttatgttt 
atcgaaaccg 
aagcaacatc 
tcaaagatct 
ttgtttctcc 
gcgtacactg 
cctttggctt 
tgcgccgtag 
aaagcgtaca 
cctttgccaa 
gggtacccaa 
aacagctcta 
ttaggagcgc 
ctttcaataa 
ttgccgtctg 
gcagggacac 
gcggcagctc 
ggtcgggcgc 
gatacacagg 
cgctcttgtc 
tcaactttgg 
accgtagtgg 
taaaagcctt 
cgcaagactt 
cgtcggtgtt 
tgctagactg 
cgtgctcctt 
cgcagcggtg 
caaacgactg 
tggtgaaggt 
ccagagcttc 
ggtacttgtc 
gcacactcag 
cctcttgcgt 
gcttacctcc 
gcgccacatc 
eg ggcttggg 
aggtcgatgg 
cgtcctcgga 
gcgacgggga 



21420 
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21540 
21600 
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cggggacgac 

ggtggtttcg 

gatcatggag 

cgcctccacc 

ggaggaggaa 

ctcagtacca 

agtcgggcgg 

gaagcatctg 

gcccctcgcc 

accccccaaa 

cgtatttgcc 

acccctatcc 

cgctgtcata 

acgcgacgag 

ctctggagtg 

catcgaggtc 

agtcatgagt 

agaacaaaca 

aacgcgcgag 

taccgtggag 

agaggaaaca 

caacgtggag 

gcaaaacgtg 

cgtttactta 

ggaggagtgc 

gacggccttc 

cctgcttaaa 

ctttaggaac 

tagcgacttt 

ccttctgcag 

tgacggtcta 

ttgcaattcg 

ctcgcctgac 

ggcttacctt 

agaccaatcc 

tcttggccaa 

gggggtttac 

gccctatcag 

agctgccgcc 

tggacgagga 

aggtcgaaga 

cccagaaatc 

cactgcccgt 

ccaagcagcc 

gcgggcacaa 

cccgccgctt 

accgtcatct 

gccacacaga 

gcggcggcag 

cgcgagctta 

caagaacaag 

tatcacaaaa 

aaatactgcg 

a?* » r* t* a r*n+" n 



acgtcctcca 

cgctgctcct 

tcagtcgaga 

gatgccgcca 

gtgattatcg 

acagaggata 

ggggacgaaa 

cagcgccagt 

atagcggatg 

cgccaagaaa 

gtgccagagg 

tgccgtgcca 

cctgatatcg 

aagcgcgcgg 

ttggtggaac 

acccactttg 

gagctgatcg 

gaggagggcc 

cctgccgact 

cttgagtgca 

ttgcactaca 

ctctgcaacc 

cttcattcca 

tttctatgct 

aacctcaagg 

aacgagcgct 

accctgcaac 

tttatcctag 

gtgcccatta 

ctagccaact 

ctggagtgtc 

cagctgctta 

gaaaagtccg 

cgcaaatttg 

cgcccgccaa 

ttgcaagcca 

ttggaccccc 

cagcagccgc 

gccacccacg 

ggaggaggac 

ggtgtcagac 

ggcaaccggt 

tcgccgaccc 

gccgccgtta 

gaacgccata 

tcttctctac 

ctacagccca 

agcaaaggcg 

cagcaggagg 

gaaacaggat 

agctgaaaat 

gcgaagatca 

cgctgactct 



tggttggggg 

cttcccgact 

agaaggacag 

acgcgcctac 

agcaggaccc 

aaaagcaaga 

ggcatggcga 

gcgccattat 

tcagccttgc 

acggcacatg 

tgcttgccac 

accgcagccg 

cctcgctcaa 

caaacgctct 

tcgagggtga 

cctacccggc 

tgcgccgtgc 

tacccgcagt 

tggaggagcg 

tgcagcggtt 

cctttcgaca 

tggtctccta 

cgctcaaggg 

acacctggca 

agctgcagaa 

ccgtggccgc 

agggtctgcc 

agcgctcagg 

agtaccgcga 

accttgccta 

actgtcgctg 

acgaaagtca 

cggctccggg 

tacctgagga 

atgcggagct 

tcaacaaagc 

agtccggcga 

gggcccttgc 

gacgaggagg 

atgatggaag 

gaaacaccgt 

tccagcatgg 

aaccgtagat 

gcccaagagc 

gttgcttgct 

catcacggcg 

tactgcaccg 

accggatagc 

aggagcgctg 

ttttcccact 

aaaaaacagg 

gcttcggcgc 



taaggactag 



at’r'.lmr'.ann.rt cinr^a namr'ci 



acgtcgcgcc 

ggccatttcc 

cctaaccgcc 

caccttcccc 

aggttttgta 

ccaggacaac 

ctacctagat 

ctgcgacgcg 

ctacgaacgc 

cgagcccaac 

ctatcacatc 

agcggacaag 

cgaagtgcca 

gcaacaggaa 

caacgcgcgc 

acttaaccta 

gcagcccctg 

tggcgacgag 

acgcaaacta 

ctttgctgac 

gggctacgta 

ccttggaatt 

cgaggcgcgc 

gacggccatg 

actgctaaag 

gcacctggcg 

agacttcacc 

aatcttgccc 

atgccctccg 

ccactctgac 

caacctatgc 

aattatcggt 

gttgaaactc 

ctaccacgcc 

taccgcctgc 

ccgccaagag 

ggagctcaac 

ttcccaggat 

aatactggga 

actgggagag 

caccctcggt 

ctacaacctc 

gggacaccac 

aacaacagcg 

tgcaagactg 

tggccttccc 

gcggcagcgg 

aagactctga 

cgtctggcgc 

ctgtatgcta 

tctctgcgat 

acgctggaag 

tttcgcgccc 

nrnrrprrr-pp 



gcaccgcgtc 

ttctcctata 

ccctctgagt 

gtcgaggcac 

agcgaagacg 

gcagaggcaa 

gtgggagacg 

ttgcaagagc 

cacctattct 

ccgcgcctca 

tttttccaaa 

cagctggcct 

aaaatctttg 

aacagcgaaa 

ctagccgtac 

ccccccaagg 

gagagggatg 

cagctagcgc 

atgatggccg 

ccggagatgc 

cgccaggcct 

ttgcacgaaa 

cgcgactacg 

ggcgtttggc 

caaaacttga 

gacatcattt 

agtcaaagca 

gccacctgct 

ccgctttggg 

ataatggaag 

accccgcacc 

acctttgagc 

actccggggc 

cacgagatta 

gtcattaccc 

tttctgctac 

ccaatccccc 

ggcacccaaa 

cagtcaggca 

cctagacgag 

cgcattcccc 

cgctcctcag 

tggaaccagg 

ccaaggctac 

tgggggcaac 

ccgtaacatc 

cagcggcagc 

caaagcccaa 

ccaacgaacc 

tatttcaaca 

ccctcacccg 

acgcggaggc 

tttctcaaat 

nt-nt* r*rrt- r>nrr 



cgcgctcggg 

ggcagaaaaa 

tcgccaccac 

ccccgcttga 

acgaggaccg 

acgaggaaca 

acgtgctgtt 

gcagcgatgt 

caccgcgcgt 

acttctaccc 

actgcaagat 

tgcggcaggg 

agggtcttgg 

atgaaagtca 

taaaacgcag 

tcatgagcac 

caaatttgca 

gctggcttca 

cagtgctcgt 

agcgcaagct 

gcaagatctc 

accgccttgg 

tccgcgactg 

agcagtgctt 

aggacctatg 

tccccgaacg 

tgttgcagaa 

gtgcacttcc 

gccactgcta 

acgtgagcgg 

gctccctggt 

tgcagggtcc 

tgtggacgtc 

ggttctacga 

aggg cc acat 

gaaagggacg 

cgccgccgca 

aagaagctgc 

gaggaggttt 

gaagcttccg 

tcgccggcgc 

gcgccgccgg 

gccggtaagt 

cgctcatggc 

atctccttcg 

ctgcattact 

aacagcagcg 

gaaatccaca 

cgtatcgacc 

gagcaggggc 

cagctgcctg 

tctcttcagt 

ttaagcgcga 

r*rrr*r»» t- a t* ct 
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25200 

25260 

25320 

25380 
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25980 

26040 
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26460 
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26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 
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27300 
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27420 

27480 

27540 

27600 

27660 
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ttcacgcctc 
ggcattggaa 
tcgggacctc 
tcggcggacg 
ctggtccact 
gaattgcccg 
gagcttgccc 
aggggaccct 
ttaattaatt 
ttgtgttatg 
tcagtagtat 
acagaaccct 
ctccccggct 
tattccacac 
gcagctcact 
gcggttgctt 
cgtgcatcag 
gctccgtcct 
gcagcataag 
cacagtaact 
atccaaagct 
agattaagtg 
tgtaattcac 
ccatcctaaa 
tggaacaatg 
tatcaatgtt 
cccgcgttag 
tgcagggaag 
gcagcggatg 
ccctactgta 
atggaacgcc 
gatctgcgtc 
tctctcaaag 
gctgccctga 
. ttctgcgagt 
ttccaaaaga 
ggtggcgtgg 
aatggcttcc 
agggtgaatc 
tcgccacctt 
aatctgctcc 
tcaggttcct 
tcccgtaggt 
gcggccactt 
ctcggagcta 
aaaatgcaag 
gtagtcatgc 
catttttctc 
aacatttaaa 
cggactacgg 
accgacagct 
tgattcatcg 
gcgtagagac 
cacataaaca 
aacatacagc 
tattaaaaaa 
caagtgcaga 
aacacccaga 
ttcctcaaat 
caattcccaa 
cgccccgcgc 
aaggtatatt 



gtcaggcaat 
ctctgcaatt 
ccggccacta 
gctacgactg 
gtcgccgcca 
aggatcatat 
gtagcctgat 
gtgttctcac 
gccacatcct 
tttcaacgtg 
agccccacca 
agtattcaac 
ggccttaaaa 
ggtttcctgt 
taagttcatg 
aacgggcggc 
gatagggcgg 
gcaggaatac 
gcgccttgtc 
gcagcacagc 
catggcgggg 
gcgacccctc 
cacctcccgg 
ccagctggcc 
acagtggaga 
ggcacaacac 
aaccatatcc 
acctcgcacg 
atcctccagt 
cggagtgcgc 
ggacgtagtc 
tccggtctcg 
catccaggcg 
taacatccac 
cacacacggg 
ttatccaaaa 
tcaaactcta 
aaaaggcaaa 
tcctctataa 
ctcaatatat 
agagcgccct 
cacagacctg 
cccttcgcag 
ccccgccagg 
tgctaaccag 
gtgctgctca 
tcatgcagat 
tcaaacatgt 
cattagaagc 
ccatgccggc 
cctcggtcat 
gtcagtgcta 
aacattacag 
cctgaaaaac 
gcttcacagc 
acaccactcg 
gcgagtatat 
aaaccgcacg 
cgtcacttcc 
cacatacaag 
cacgtcacaa 
attgatgatg 



cctaactctg 
tattgaggag 
tccggatcaa 
aatgttaagt 
caagtgcttt 
cgagggcccg 
tcgggagttt 
tgtgatttgc 
cttacacttt 
tttatttttc 
ccacatagct 
ctgccacctc 
agcatcatat 
cgagccaaac 
tcgctgtcca 
gaaggagaag 
tggtgctgca 
aacatggcag 
ctccgggcac 
accacaatat 
accacagaac 
ataaacacgc 
taccatataa 
aaaacctgcc 
gcccaggact 
aggcacacgt 
cagggaacaa 
taactcacgt 
atggtagcgc 
cgagacaacc 
atatttcctg 
ccgcttagat 
ccccctggct 
caccgcagaa 
aggagcggga 
cctcaaaatg 
cagccaaaga 
cggccctcac 
acattccagc 
ctctaagcaa 
ccaccttcag 
tataagattc 
ggccagctga 
aaccttgaca 
cgtagccccg 
aaaaatcagg 
aaaggcaggt 
ctgcgggttt 
ctgtcttaca 
gtgaccgtaa 
gtccggagtc 
aaaagcgacc 
cccccatagg 
cctcctgcct 
ggcagcctaa 
acacggcacc 
ataggactaa 
cgaacctacg 
gttttcccac 
ttactccgcc 
actccacccc 



cagacctcgt 
tttgtgccat 
tttattccta 
ggagaggcag 
gcccgcgact 
gcgcacggcg 
acccagcgcc 
aactgtccta 
ttcatacatt 
aattgcagaa 
tatacagatc 
cctcccaaca 
catgggtaac 
gctcatcagt 
gctgctgagc 
tccacgccta 
gcagcgcgcg 
tggtctcctc 
agcagcgcac 
tgttcaaaat 
ccacgtggcc 
tggacataaa 
acctctgatt 
cgccggctat 
cgtaaccatg 
gcatacactt 
cccattcctg 
tgtgcattgt 
gggtttctgt 
gagatcgtgt 
aagcaaaacc 
cgctctgtgt 
tcgggttcta 
taagccacac 
agagctggaa 
aagatctatt 
acagataatg 
gtccaagtgg 
accttcaacc 
atcccgaata 
cctcaagcag 
aaaagcggaa 
acataatcgt 
aaagaaccca 
atgtaagctt 
caaagcctcg 
aagctccgga 
ctgcataaac 
acaggaaaaa 
aaaaactggt 
ataatgtaag 
gaaatagccc 
aggtataaca 
aggcaaaata 
cagtcagcct 
agctcaatca 
aaaatgacgt 
cccagaaacg 
gttacgtaac 
ctaaaaccta 
ctcattatca 



cctctgagcc 
cggtctactt 
actttgacgc 
agcaactgcg 
ccggtgagtt 
tccggcttac 
ccctgctagt 
accttggatt 
gcccaagaat 
aatttcaagt 
accgtacctt 
cacagagtac 
agacatattc 
gatattaata 
cacaggctgc . 
catgggggta 
aataaactgc 
agcgatgatt 
cctgatctca 
cccacagtgc 
atcataccac 
cattacctct 
aaacatggcg 
acactgcagg 
gatcatcatg 
cctcaggatt 
aatcagcgta 
caaagtgtta 
ctcaaaagga 
tggtcgtagt 
aggtgcgggc 
agtagttgta 
tgtaaactcc 
ccagccaacc 
gaaccatgtt 
aagtgaacgc 
gcatttgtaa 
acgtaaaggc 
atgcccaaat 
ttaagtccgg 
cgaatcatga 
cattaacaaa 
gcaggtctgc 
cactgattat 
tgttgcatgg 
cgcaaaaaag 
accaccacag 
acaaaataaa 
caacccttat 
caccgtgatt 
actcggtaaa 
gggggaatac 
aaattaatag 
gcaccctccc 
taccagtaaa 
gtcacagtgt 
aacggttaaa 
aaagccaaaa 
ttcccatttt 
cgtcacccgc 
tattggcttc 



gcgctctgga 
taaccccttc 
ggtaaaggac 
cctgaaacac 
ttgctacttt 
cgcccaggga 
tgagcgggac 
acatcaagat 
aaagaatcgt 
catttttcat 
aatcaaactc 
acagtccttt 
ttaggtgtta 
aactccccgg 
tgtccaactt 
gagtcataat 
tgccgccgcc 
cgcaccgccc 
cttaaatcag 
aaggcgctgt 
aagcgcaggt 
tttggcatgt 
ccatccacca 
gaaccgggac 
ctcgtcatga 
acaagctcct 
aatcccacac 
cattcgggca 
ggtagacgat 
gtcatgccaa 
gtgacaaaca 
gtatatccac 
ttcatgcgcc 
tacacattcg 
ttttttttta 

gctcccctcc 
gatgttgcac 
taaacccttc 
aattctcatc 
ccattgtaaa 
ttgcaaaaat 
aataccgcga 
acggaccagc 
gacacgcata 
gcggcgatat 
aaagcacatc 
aaaaagacac 
ataacaaaaa 
aagcataaga 
aaaaagcacc 
cacatcaggt 
atacccgcag 
gagagaaaaa 
gctccagaac 
aaagaaaacc 
aaaaaagggc 
gtccacaaaa 
aacccacaac 
aagaaaacta 
cccgttccca 
aatccaaaat 



28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32480 
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<210> 24 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> Description of Artificial Sequence: primer 

<400> 24 25 

ctcaacaatt gtggatccgt actcc 

<210> 25 
<211> 25 
<212> DNA 

<213 > Artificial Sequence 



<223 > Description of Artificial Sequence: primer 

<400> 25 25 

gtgctcagca gatcttgcga ctgtg 



<210> 26 
<211> 25 
<212> DNA 

<213 > Artificial Sequence 

<223> Description of Artificial Sequence: primer 

<400> 26 25 

ggcgcgttcg gatccactct cttcc 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

<223 > Description of Artificial Sequence: primer 

<400> 27 28 

ctacatgcta ggcagatctc gttcggag 



<210> 28 

<211> 1240 

<212> DNA 

<213 > adenovirus 



<400> 28 

ggatccactc tcttccgcat 
ctgaaaagcg ggcatgactt 
gatattcacc tggcccgcgg 
gacaatcttt ttgttgtcaa 
cttggcgatg gagcgcaggg 
gtttagctgc acgtattcgc 
gtcgggcacc aggtgcacgc 
ggctacctct ccgcgtaggc 
gaatggcggt agggggtcta 
cccgggcagc aggcgcgcgt 



cgctgtctgc 

ctgcgctaag 

tgatgccttt 

gcttggtggc 

tttggttttt 

gcgcaacgca 

gccaaccgcg 

gctcgttggt 

gctgcgtctc 

cgaagtagtc 



gagggccagc 

attgtcagtt 

gagggtggcc 

aaacgacccg 

gtcgcgatcg 

ccgccattcg 

gttgtgcagg 

ccagcagagg 

gtccgggggg 

tatcttgcat 



tgttggggtg 

tccaaaaacg 

gcatccatct 

tagagggcgt 

gcgcgctcct 

ggaaagacgg 

gtgacaaggt 

cggccgccct 

tctgcgtcca 

ccttgcaagt 



agtactccct 60 
aggaggattt 120 
ggtcagaaaa 180 
tggacagcaa 240 
tggccgcgat 300 
tggtgcgctc 360 
caacgctggt 420 
tgcgcgagca 480 
cggtaaagac 540 
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ctgccatgcg cgggcggcaa gcgcgcgctc gtatgggttg agtgggggac cccatggcat 660 
agggtgggtg agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct 720 
gagtattcca agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc 780 
gtatagttcg tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgctc 840 
tgctcggaag actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg 900 
gaagacgttg aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga 960 
gtcgcgcagc ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag 1020 
ggtttccttg atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag 1080 
gacaaactct- tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg 1140 
agatccgtac tccgccgccg agggacctga gcgagtccgc atcgaccgga tcggaaaacc 1200 
tctcgagaaa ggcgtctaac cagtcacagt cgcaagatct 1240 

<210> 29 
<211> 8383 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> Description of Artificial Sequence: plasmid pDV60 



<400> 29 

gacggatcgg 

ccgcatagtt 

cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 

gagctcggat 

ctccctctga 

ggatttgata 

agaaaagaca 

cagcaacttg 

cgcgatgttt 

gcgctcgtcg 

gctggtggct 

cgagcagaat 

aaagaccccg 

cgcctgctgc 

tggcatgggg 

ctctctgagt 

gtaatcgtat 

ctgctctgct 

acgctggaag 

gtaggagtcg 

gtccagggtt 

gttgaggaca 

cgaacgagat 

aaaacctctc 

aagaccgtct 

aactgtgcct 

tggggtactc 

caaaatgggc 

cactgtgagc 



gagatctccc 

aagccagtat 

ttaagctaca 

gcgttttgcg 

tagttattaa 

cgttacataa 

gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 

ccactctctt 

aaagcgggca 

ttcacctggc 

atctttttgt 

gcgatggagc 

agctgcacgt 

ggcaccaggt 

acctctccgc 

ggcggtaggg 

ggcagcaggc 

catgcgcggg 

tgggtgagcg 

attccaagat 

agttcgtgcg 

cggaagacta 

acgttgaagc 

cgcagcttgt 

tccttgatga 

aactcttcgc 

ccgtactccg 

gagaaaggcg 

gaagatacct 

tttcttactc 

tctttgcgcc 

aacggcctct 

ccacctctca 



gatcccctat 

ctgctccctg 

acaaggcaag 

ctgcttcgcg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 

ccgcatcgct 

tgacttctgc 

ccgcggtgat 

tgtcaagctt 

gcagggtttg 

attcgcgcgc 

gcacgcgcca 

gtaggcgctc 

ggtctagctg 

gcgcgtcgaa 

cggcaagcgc 

cggaggcgta 

atgtagggta 

agggagcgag 

tctgcctgaa 

tggcgtctgt 

tgaccagctc 

tgtcatactt 

ggtctttcca 

ccgccgaggg 

tctaaccagt 

tcaaccccgt 

ctccctttgt 

tatccgaacc 

ctctggacga 

aaaaaaccaa 



ggtcgactct 

cttgtgtgtt 

gcttgaccga 

atgtacgggc 

ttacggggtc 

atggcccgcc 

ttcccatagt 

aaactgccca 

tcaatgacgg 

ctacttggca 

agtacatcaa 

ttgacgtcaa 

acaactccgc 

gcagagctct 

ctcactatag 

gtctgcgagg 

gctaagattg 

gcctttgagg 

ggtggcaaac 

gtttttgtcg 

aacgcaccgc 

accgcggttg 

gttggtccag 

cgtctcgtcc 

gtagtctatc 

gcgctcgtat 

catgccgcaa 

gcatcttcca 

gaggtcggga 

gatggcatgt 

gagacctacc 

ggcggtgacc 

atcctgtccc 

gtactcttgg 

acctgagcga 

cacagtcgca 

gtatccatat 

atcccccaat 

tctagttacc 

ggccggcaac 

gtcaaacata 



cagtacaatc 

ggaggtcgct 

caattgcatg 

cagatatacg 

attagttcat 

tggctgaccg 

aacgccaata 

cttggcagta 

taaatggccc 

gtacatctac 

tgggcgtgga 

tgggagtttg 

cccattgacg 

ctggctaact 

ggagacccaa 

gccagctgtt 

tcagtttcca 

gtggccgcat 

gacccgtaga 

cgatcggcgc 

cattcgggaa 

tgcagggtga 

cagaggcggc 

ggggggtctg 

ttgcatcctt 

gggttgagtg 

atgtcgtaaa 

ccgcggatgc 

ccgaggttgc 

gagttggatg 

gcgtcacgca 

tgcacgtcta 

ttttttttcc 

atcggaaacc 

gtccgcatcg 

agatccaaga 

gacacggaaa 

gggtttcaag 

tccaatggca 

cttacctccc 

aacctggaaa 



tgctctgatg 

gagtagtgcg 

aagaatctgc 

cgttgacatt 

agcccatata 

cccaacgacc 

gggactttcc 

catcaagtgt 

gcctggcatt 

gtattagtca 

tagcggtttg 

ttttggcacc 

caaatgggcg 

agagaaccca 

gcttggtacc 

ggggtgagta 

aaaacgagga 

ccatctggtc 

gggcgttgga 

gctccttggc 

agacggtggt 

caaggtcaac 

cgcccttgcg 

cgtccacggt 

gcaagtctag 

gggg acccca 

cgtagagggg 

tggcgcgcac 

tacgggcggg 

atatggttgg 

cgaaggaggc 

gggcgcagta 

acagctcgcg 

cgtcggcctc 

accggatcgg 

tgaagcgcgc 

ccggtcctcc 

agagtccccc 

tgcttgcgct 

aaaatgtaac 

tatctgcacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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cctcacagtt 
caacacactc 
tgccacccaa 
cctcaccacc 
cactggtagc 
actaaagtac 
tggtccaggt 
ttttgattca 
aaacagacgc 
aagactagga 
caaaggcctt 
cactgccaag 
tgaatttggt 
cctagaattt 
cagcacaggt 
accagctcca 
cttaacaaaa 
tttggctcca 
tggagtgcta 
tcttactgaa 
tccaaaatct 
agacaaaact 
cacaactcca 
taatgaaata 
gcggccgctc 
ctcgctgatc 
ccgtgccttc 
aaattgcatc 
acagcaaggg 
tggcttctga 
gcggcgcatt 
gcgccctagc 
ttccccgtca 
acctcgaccc 
agacggtttt 
aaactggaac 
ggatttcggc 
tctgtggaat 
gtatgcaaag 
cagcaggcag 
taactccgcc 
gactaatttt 
agtagtgagg 
tatccatttt 
atggattgca 
cacaacagac 
cggttctttt 
cgcggctatc 
ctgaagcggg 
ctcaccttgc 
cgcttgatcc 
gtactcggat 
tcgcgccagc 
tcgtgaccca 
gattcatcga 
cccgtgatat 
gtatcgccgc 
gagcgggact 
tttcgattcc 
cggctggatg 
gtttattgca 
agcatttttt 



acctcagaag 
accatgcaat 
ggacccctca 
accgatagca 
ttgggcattg 
ggggctcctt 
gtgactatta 
caaggcaata 
cttatacttg 
cagggccctc 
tacttgttta 
gggttgatgt 
tcacctaatg 
gattcaaaca 
gccattacag 
tctcctaact 
tgtggcagtc 
atatctggaa 
ctaaacaatt 
ggcacagcct. 
cacggtaaaa 
aaacctgtaa 
agtgcatact 
tttgccacat 
gagcatgcat 
agcctcgact 
cttgaccctg 
gcattgtctg 
ggaggattgg 
ggcggaaaga 
aagcgcggcg 
gcccgctcct 
agctctaaat 
caaaaaactt 
tcgccctttg 
aacactcaac 
ctattggtta 
gtgtgtcagt 
catgcatctc 
aagtatgcaa 
catcccgccc 
ttttatttat 
aggctttttt 
cggatctgat 
cgcaggttct 
aatcggctgc 
tgtcaagacc 
gtggctggcc 
aagggactgg 
tcctgccgag 
ggctacctgc 
ggaagccggt 
cgaactgttc 
tggcgatgcc 
ctgtggccgg 
tgctgaagag 
tcccgattcg 
ctggggttcg 
accgccgcct 
atcctccagc 
gcttataatg 
tcactgcatt 



ccctaactgt 
cacaggcccc 
cagtgtcaga 
gtacccttac 
acttgaaaga 
tgcatgtaac 
ataatacttc 
tgcaacttaa 
atgttagtta 
tttttataaa 
cagcttcaaa 
ttgacgctac 
caccaaacac 
aggctatggt 
taggaaacaa 
gtagactaaa 
aaatacttgc 
cagttcaaag 
ccttcctgga 
atacaaacgc 
ctgccaaaag 
cactaaccat 
ctatgtcatt 
cctcttacac 
ctagagggcc 
gtgccttcta 
gaaggtgcca 
agtaggtgtc 
gaagacaata 
accagctggg 
ggtgtggtgg 
ttcgctttct 
cggggcatcc 
gattagggtg 
acgttggagt 
cctatctcgg 
aaaaatgagc 
tagggtgtgg 
aattagtcag 
agcatgcatc 
ctaactccgc 
gcagaggccg 
ggaggcctag 
caagagacag 
ccggccgctt 
tctgatgccg 
gacctgtccg 
acgacgggcg 
ctgctattgg 
aaagtatcca 
ccattcgacc 
cttgtcgatc 
gccaggctca 
tgcttgccga 
ctgggtgtgg 
cttggcggcg 
cagcgcatcg 
aaatgaccga 
tctatgaaag 
gcggggatct 
gttacaaata 
ctagttgtgg 



ggctgccgcc 
gctaaccgtg 
aggaaagcta 
tatcactgcc 
gcccatttat 
agacgaccta 
cttgcaaact 
tgtagcagga 
tccgtttgat 
ctcagcccac 
caattccaaa 
agccatagcc 
aaatcccctc 
tcctaaacta 
aaataatgat 
tgcagagaaa 
tacagtttca 
tgctcatctt 
cccagaatat 
tgttggattt 
taacattgtc 
tacactaaac 
ttcatgggac 
tttttcatac 
ctattctata 
gttgccagcc 
ctcccactgt 
attctattct 
gcaggcatgc 
gctctagggg 
ttacgcgcag 
tcccttcctt 
ctttagggtt 
atggttcacg 
ccacgttctt 
tctattcttt 
tgatttaaca 
aaagtcccca 
caaccaggtg 
tcaattagtc 
ccagttccgc 
aggccgcctc 
gcttttgcaa 
gatgaggatc 
gggtggagag 
ccgtgttcc:g 
gtgccctgaa 
ttccttgcgc 
gcgaagtgcc 
tcatggctga 
accaagcgaa 
aggatgatct 
aggcgcgcat 
atatcatggt 
cggaccgcta 
aatgggctga 
ccttctatcg 
ccaagcgacg 
gttgggcttc 
catgctggag 
aagcaatagc 
tttgtccaaa 



gcacctctaa 
cacgactcca 
gccctgcaaa 
tcaccccctc 
acacaaaatg 
aacactttga 
aaagttactg 
ggactaagga 
gctcaaaacc 
aacttggata 
aagcttgagg 
attaatgcag 
aaaacaaaaa 
ggaactggcc 
aagctaactt 
gatgctaaac 
gttttggctg 
attataagat 
tggaacttta 
atgcctaacc 
agtcaagttt 
ggtacacagg 
tggtctggcc 
attgcccaag 
gtgtcaccta 
atctgttgtt 
cctttcctaa 
999999^-999 
tggggatgcg 
gtatccccac 
cgtgaccgct 
tctcgccacg 
ccgatttagt 
tagtgggcca 
taatagtgga 
tgatttataa 
aaaatttaac 
ggctccccag 
tggaaagtcc 
agcaaccata 
ccattctccg 
tgcctctgag 
aaagctcccg 
gtttcgcatg 
gctattcggc 
gctgtcagcg 
tgaactgcag 
agctgtgctc 
ggggcaggat 
tgcaatgcgg 
acatcgcatc 
ggacgaagag 
gcccgacggc 
ggaaaatggc 
tcaggacata 
ccgcttcctc 
ccttcttgac 
cccaacctgc 
ggaatcgttt 
ttcttcgccc 
atcacaaatt 
ctcatcaatg 



tggtcgcggg 
aacttagcat 
catcaggccc 
taactactgc 
gaaaactagg 
ccgtagcaac 
gagccttggg 
ttgattctca 
aactaaatct 
ttaactacaa 
ttaacctaag 
gagatgggct 
ttggccatgg 
ttagttttga 
tgtggaccac 
tcactttggt 
ttaaaggcag 
ttgacgaaaa 
gaaatggaga 
tatcagctta 
acttaaacgg 
aaacaggaga 
acaactacat 
aataaaagaa 
aatgctagag 
tgcccctccc 
taaaatgagg 
gtggggcagg 
gtgggctcta 
gcgccctgta 
acacttgcca 
ttcgccggct 
gctttacggc 
tcgccctgat 
ctcttgttcc 
gggattttgg 
gcgaattaat 
gcaggcagaa 
ccaggctccc 
gtcccgcccc 
ccccatggct 
ctattccaga 
ggagcttgta 
attgaacaag 
tatgactggg 
caggggcgcc 
gacgaggcag 
gacgttgtca 
ctcctgtcat 
cggctgcata 
gagcgagcac 
catcaggggc 
gaggatctcg 
cgcttttctg 
gcgttggcta 
gtgctttacg 
gagttcttct 
catcacgaga 
tccgggacgc 
accccaactt 
tcacaaataa 
tatcttatca 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
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tgtctgtata 

tgtgtgaaat 

taaagcctgg 

cgctttccag 

gagaggcggt 

ggtcgttcgg 

agaatcaggg 

ccgtaaaaag 

caaaaatcga 

gtttccccct 

cctgtccgcc 

tctcagttcg 

gcccgaccgc 

cttatcgcca 

tgctacagag 

tatctgcgct 

caaacaaacc 

aaaaaaagga 

cgaaaactca 

ccttttaaat 

tgacagttac 

atccatagtt 

tggccccagt 

aataaaccag 

catccagtct 

gcgcaacgtt 

ttcattcagc 

aaaagcggtt 

atcactcatg 

cttttctgtg 

gagttgctct 

agtgctcatc 

gagatccagt 

caccagcgtt 

ggcgacacgg 

tcagggttat 

aggggttccg 



ccgtcgacct 

tgttatccgc 

ggtgcctaat 

tcgggaaacc 

ttgcgtattg 

ctgcggcgag 

gataacgcag 

gccgcgttgc 

cgctcaagtc 

ggaagctccc 

tttctccctt 

gtgtaggtcg 

tgcgccttat 

ctggcagcag 

ttcttgaagt 

ctgctgaagc 

accgctggta 

tctcaagaag 

cgttaaggga 

taaaaatgaa 

caatgcttaa 

gcctgactcc 

gctgcaatga 

ccagccggaa 

attaattgtt 

gttgccattg 

tccggttccc 

agctccttcg 

gttatggcag 

actggtgagt 

tgcccggcgt 

attggaaaac 

tcgatgtaac 

tctgggtgag 

aaatgttgaa 

tgtctcatga 

cgcacatttc 



ctagctagag 
tcacaattcc 
gagtgagcta 
tgtcgtgcca 
ggcgctcttc 
cggtatcagc 
gaaagaacat 
tggcgttttt 
agaggtggcg 
tcgtgcgctc 
cgggaagcgt 
ttcgctccaa 
ccggtaacta 
ccactggtaa 
ggtggcctaa 
cagttacctt 
gcggtggttt 
atcctttgat 
ttttggtcat 
gttttaaatc 
tcagtgaggc 
ccgtcgtgta 
taccgcgaga 
gggccgagcg 
gccgggaagc 
ctacaggcat 
aacgatcaag 
gtcctccgat 
cactgcataa 
actcaaccaa 
caatacggga 
gttcttcggg 
ccactcgtgc 
caaaaacagg 
tactcatact 
gcggatacat 
cccgaaaagt 



cttggcgtaa 

acacaacata 

actcacatta 

gctgcattaa 

cgcttcctcg 

tcactcaaag 

gtgagcaaaa 

ccataggctc 

aaacccgaca 

tcctgttccg 

ggcgctttct 

gctgggctgt 

tcgtcttgag 

caggattagc 

ctacggctac 

cggaaaaaga 

ttttgtttgc 

cttttctacg 

gagattatca 

aatctaaagt 

acctatctca 

gataactacg 

cccacgctca 

cagaagtggt 

tagagtaagt 

cgtggtgtca 

gcgagttaca 

cgttgtcaga 

ttctcttact 

gtcattctga 

taataccgcg 

gcgaaaactc 

acccaactga 

aaggcaaaat 

cttccttttt 

atttgaatgt 

gccacctgac 



tcatggtcat 

cgagccggaa 

attgcgttgc 

tgaatcggcc 

ctcactgact 

gcggtaatac 

ggccagcaaa 

cgcccccctg 

ggactataaa 

accctgccgc 

caatgctcac 

gtgcacgaac 

tccaacccgg 

agagcgaggt 

actagaagga 

gttggtagct 

aagcagcaga 

gggtctgacg 

aaaaggatct 

atatatgagt 

gcgatctgtc 

atacgggagg 

ccggctccag 

cctgcaactt 

agttcgccag 

cgctcgtcgt 

tgatccccca 

agtaagttgg 

gtcatgccat 

gaatagtgta 

ccacatagca 

tcaaggatct 

tcttcagcat 

gccgcaaaaa 

caatattatt 

atttagaaaa 

gtc 



agctgtttcc 
gcataaagtg 
gctcactgcc 
aacgcgcggg 
cgctgcgctc 
ggttatccac 
aggccaggaa 
acgagcatca 
gataccaggc 
ttaccggata 
gctgtaggta 
cccccgttca 
taagacacga 
atgtaggcgg 
cagtatttgg 
cttgatccgg 
ttacgcgcag 
ctcagtggaa 
tcacctagat 
aaacttggtc 
tatttcgttc 
gcttaccatc 
atttatcagc 
tatccgcctc 
ttaatagttt 
ttggtatggc 
tgttgtgcaa 
ccgcagtgtt 
ccgtaagatg 
tgcggcgacc 
gaactttaaa 
taccgctgtt 
cttttacttt 
agggaataag 
gaagcattta 
ataaacaaat 



6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8383 



<210> 30 
<211> 7960 
<212> DNA 

<213> Artificial Sequence 



< 220 > 

<223 



> Description of Artificial Sequence : plasmid pDV67 



<4 00> 3 0 

gacggatcgg 

ccgcatagtt 

cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 



gagatctccc 

aagccagtat 

ttaagctaca. 

gcgttttgcg 

tagttattaa 

cgttacataa 

gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 



gatcccctat 

ctgctccctg 

acaaggcaag 

ctgcttcgcg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 



ggtcgactct 

cttgtgtgtt 

gcttgaccga 

atgtacgggc 

ttacggggtc 

atggcccgcc 

ttcccatagt 

aaactgccca 

tcaatgacgg 

ctacttggca 

agtacatcaa 

ttgacgtcaa 

acaactccgc 

gcagagctct 

ctcactatag 



cagtacaatc 

ggaggtcgct 

caattgcatg 

cagatatacg 

attagttcat 

tggctgaccg 

aacgccaata 

cttggcagta 

taaatggccc 

gtacatctac 

tgggcgtgga 

tgggagtttg 

cccattgacg 

ctggctaact 

ggagacccaa 



tgctctgatg 60 
gagtagtgcg 120 
aagaatctgc 180 
cgttgacatt 240 
agcccatata 300 
cccaacgacc 360 
gggactttcc 420 
catcaagtgt 480 
gcctggcatt 540 
gtattagtca 600 
tagcggtttg 660 
ttttggcacc 720 
caaatgggcg 780 
agagaaccca 840 
antaactaqc 900 
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gtttaaactt 
gggccagctg 
tgtcagtttc 
gggtggccgc 
acgacccgta 
cgcgatcggc 
gccattcggg 
tgtgcagggt 
agcagaggcg 
ccggggggtc 
tcttgcatcc 
atgggttgag 
aaatgtcgta 
caccgcggat 
gaccgaggtt 
gtgagttgga 
ccgcgtcacg 
cctgcacgtc 
cctttttttt 
ggatcggaaa 
gagtccgcat 
caagatccaa 
atgacacgga 
atgggtttca 
cctccaatgg 
accttacctc 
taaacctgga 
ccgcacctct 
tgcacgactc 
tagccctgca 
cctcaccccc 
atacacaaaa 
taaacacttt 
ctaaagttac 
gaggactaag 
atgctcaaaa 
acaacttgga 
aaaagcttga 
ccattaatgc 
tcaaaacaaa 
taggaactgg 
ataagctaac 
aagatgctaa 
cagttttggc 
ttattataag 
attggaactt 
ttatgcctaa 
tcagtcaagt 
acggtacaca 
actggtctgg 
acattgccca 
gatcagcctc 
cttccttgac 
catcgcattg 
agggggagga 
ctgaggcgga 
cattaagcgc 
tagcgcccgc 
gtcaagctct 
accccaaaaa 
tttttcgccc 
gaacaacact 



aagcttggta 
ttggggtgag 
caaaaacgag 
atccatctgg 
gagggcgttg 
gcgctccttg 
aaagacggtg 
gacaaggtca 
gccgcccttg 
tgcgtccacg 
ttgcaagtct 
tgggggaccc 
aacgtagagg 
gctggcgcgc 
gctacgggcg 
tgatatggtt 
cacgaaggag 
tagggcgcag 
ccacagctcg 
cccgtcggcc 
cgaccggatc 
gatgaagcgc 
aaccggtcct 
agagagtccc 
catgcttgcg 
ccaaaatgta 
aatatctgca 
aatggtcgcg 
caaacttagc 
aacatcaggc 
tctaactact 
tggaaaacta 
gaccgtagca 
tggagccttg ■ 
gattgattct 
ccaactaaat 
tattaactac 
ggttaaccta 
aggagatggg 
aattggccat 
ccttagtttt 
tttgtggacc 
actcactttg 
tgttaaaggc 
atttgacgaa 
tagaaatgga 
cctatcagct 
ttacttaaac 
ggaaacagga 
ccacaactac 
agaataaaag 
gactgtgcct 
cctggaaggt 
tctgagtagg 
ttgggaagac 
aagaaccagc 
ggcgggtgtg 
tcctttcgct 
aaatcggggc 
acttgattag 
tttgacgttg 
caaccctatc 



ccgagctcgg 
tactccctct 
gaggatttga 
tcagaaaaga 
gacagcaact 
gccgcgatgt 
gtgcgctcgt 
acgctggtgg 
cgcgagcaga 
gtaaagaccc 
agcgcctgct 
catggcatgg 
ggctctctga 
acgtaatcgt 
ggctgctctg 
ggacgctgga 
gcgtaggagt 
tagtccaggg 
cggttgagga 
tccgaacgag 
ggaaaacctc 
gcaagaccgt 
cc aactgtgc 
cctggggtac 
ctcaaaatgg 
accactgtga 
cccctcacag 
ggcaacacac 
attgccaccc 
cccctcacca 
gccactggta 
ggactaaagt 
actggtccag 
■ ggttttgatt 
caaaacagac 
ctaagactag 
aacaaaggcc 
agcactgcca 
cttgaatttg 
ggcctagaat 
gacagcacag 
acaccagctc 
gtcttaacaa 
agtttggctc 
aatggagtgc 
gatcttactg 
tatccaaaat 
ggagacaaaa 
gacacaactc 
attaatgaaa 
aagcggccgc 
tctagttgcc 
gccactccca 
tgtcattcta 
aatagcaggc 
tggggctcta 
gtggttacgc 
ttcttccctt 
atccctttag 
ggtgatggtt 
gagtccacgt 
tcggtctatt 



atccactctc 
gaaaagcggg 
tattcacctg 
caatcttttt 
tggcgatgga 
ttagctgcac 
cgggcaccag 
ctacctctcc 
atggcggtag 
cgggcagcag 
gccatgcgcg 
ggtgggtgag 
gtattccaag 
atagttcgtg 
ctcggaagac 
agacgttgaa 
cgcgcagctt 
tttccttgat 
caaactcttc 
atccgtactc 
tcgagaaagg 
ctgaagatac 
cttttcttac 
tctctttgcg 
gcaacggcct 
gcccacctct 
ttacctcaga 
tcaccatgca 
aaggacccct 
ccaccgatag 
gcttgggcat 
acggggctcc 
gtgtgactat 
cacaaggcaa 
gccttatact 
gacagggccc 
tttacttgtt 
agg99 tt 9 at 
gttcacctaa 
ttgattcaaa 
gtgccattac 
catctcctaa 
aatgtggcag 
caatatctgg 
tactaaacaa 
aaggcacagc 
ctcacggtaa 
ctaaacctgt 
caagtgcata 
tatttgccac 
tcgagtctag 
agccatctgt 
ctgtcctttc 

ttctgggggg 

atgctgggga 

gggggtat- cc 

gcagcgtgac 

cctttctcgc 

ggttccgatt 

cacgtagtgg 

tctttaatag 

cttttgattt 



ttccgcatcg 
catgacttct 
gcccgcggtg 
gttgtcaagc 
gcgcagggtt 
gtattcgcgc 
gtgcacgcgc 
gcgtaggcgc 
ggggtctagc 
gcgcgcgtcg 
ggcggcaagc 
cgcggaggcg 
atatgtaggg 
cgagggagcg 
tatctgcctg 
gctggcgtct 
gttgaccagc 
gatgtcatac 
gcggtctttc 
cgccgccgag 
cgtctaacca 
cttcaacccc 
tcctcccttt 
cctatccgaa 
ctctctggac 
caaaaaaacc 
agccctaact 
atcacaggcc 
cacagtgtca 
cagtaccctt 
tgacttgaaa 
tttgcatgta 
taataatact 
tatgcaactt 
tgatgttagt 
tctttttata 
tacagcttca 
gtttgacgct 
tgcaccaaac 
caaggctatg 
agtaggaaac 
ctgtagacta 
tcaaatactt 
aacagttcaa 
ttccttcctg 
ctatacaaac 
aactgccaaa 
aacactaacc 
ctctatgtca 
atcctcttac 
agggcccgtt 
tgtttgcccc 
ctaataaaat 
tggggtgggg 
tgcggtgggc 
ccacgcgccc 
cgctacactt 
cacgttcgcc 
tagtgcttta 
gccatcgccc 
tggactcttg 
ataagggatt 



ctgtctgcga 
gcgctaagat 
atgcctttga 
ttggtggcaa 
tggtttttgt 
gcaacgcacc 
caaccgcggt 
tcgttggtcc 
tgcgtctcgt 
aagtagtcta 
gcgcgctcgt 
tacatgccgc 
tagcatcttc 
aggaggtcgg 
aagatggcat 
gtgagaccta 
tcggcggtga 
ttatcctgtc 
cagtactctt 
ggacctgagc 
gtcacagtcg 
gtgtatccat 
gtatccccca 
cctctagtta 
gaggccggca 
aagtcaaaca 
gtggctgccg 
ccgctaaccg 
gaaggaaagc 
actatcactg 
gagcccattt 
acagacgacc 
tccttgcaaa 
aatgtagcag 
tatccgtttg 
aactcagccc 
aacaattcca 
acagccatag 
acaaatcccc 
gttcctaaac 
aaaaataatg 
aatgcagaga 
gctacagttt 
agtgctcatc 
gacccagaat 
gctgttggat 
agtaacattg 
attacactaa 
ttttcatggg 

actttttcat 
taaacccgct 
tcccccgtgc 
gaggaaattg 
caggacagca 
tctatggctt 
tgtagcggcg 
gccagcgccc 
ggctttcccc 
cggcacctcg 
tgatagacgg 
ttccaaactg 
ttggggattt 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
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cggcctattg 
gaatgtgtgt 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
ttttcggatc 
ataatacgac 
caccgcgcgc 
ggacttcgtg 
cgcggtccag 
ggacgagctg 
gccggccatg 
ggccggcaac 
cgattccacc 
ctggatgatc 
tattgcagct 
atttttttca 
ctgtataccg 
gtgaaattgt 
.agcctggggt 
tttccagtcg 
aggcggtttg 
cgttcggctg 
atcaggggat 
taaaaaggcc 
aaatcgacgc 
tccccctgga 
gtccgccttt 
cagttcggtg 
cgaccgctgc 
atcgccactg 
tacagagttc 
ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 
catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



gttaaaaaat 
cagttagggt 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
tgatcagcac 
aaggtgagga 
gacgtcgccg 
gaggacgact 
gaccaggtgg 
tacgccgagt 
accgagatcg 
tgcgtgcact 
gccgccttct 
ctccagcgcg 
tataatggtt 
ctgcattcta 
tcgacctcta 
tatccgctca 
gcctaatgag 
ggaaacctgt 
cgtattgggc 
cggcgagcgg 
aacgcaggaa 
gcgttgctgg 
tcaagtcaga 
agctccctcg 
ctcccttcgg 
taggtcgttc 
gccttatccg 
gcagcagcca 
ttgaagtggt 
ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 
tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 

gggtga-s 03 ^ 

tgttgaatac 
ctcatgagcg 
acatttcccc 



gagctgattt 
gtggaaagtc 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 
gtgttgacaa 
actaaaccat 
gagcggtcga 
tcgccggtgt 
tgccggacaa 
ggtcggaggt 
gcgagcagcc 
tcgtggccga 
atgaaaggtt 
gggatctcat 
acaaataaag 
gttgtggttt 
gctagagctt 
caattccaca 
tgagctaact 
cgtgccagct 
gctcttccgc 
tatcagctca 
agaacatgtg 
cgtttttcca 
ggtggegaaa 
tgcgctctcc 
gaagcgtggc 
gctccaagct 
gtaactatcg 
ctggtaacag 
ggcctaacta 
ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 
tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



aacaaaaatt 
cccaggctcc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctctgcctc 
gcaaaaagct 
ttaatcatcg 
ggccaagttg 
gttctggacc 
ggtccgggac 
caccctggcc 
cgtgtccacg 
gtgggggcgg 
ggagcaggac 
gggcttcgga 
gctggagttc 
caatagcatc 
gtccaaactc 
ggcgtaatca 
caacatacga 
cacattaatt 
gcattaatga 
ttcctcgctc 
ctcaaaggcg 
agcaaaaggc 
taggctccgc 
cccgacagga 
tgttccgacc 
gctttctcaa 
gggctgtgtg 
tcttgagtcc 
gattagcaga 
cggctacact 
aaaaagagtt 
tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 
aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



taacgcgaat 
ccaggcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
cccgggagct 
gcatagtata 
accagtgccg 
gaccggctcg 
gacgtgaccc 
tgggtgtggg 
aacttccggg 
gagttcgccc 
tgacacgtgc 
atcgttttcc 
ttcgcccacc 
acaaatttca 
atcaatgtat 
tggtcatagc 
gccggaagca 
gcgttgcgct 
atcggccaac 
actgactcgc 
gtaatacggt 
cagcaaaagg 
ccccctgacg 
ctataaagat 
ctgccgctta 
tgctcacgct 
cacgaacccc 
aacccggtaa 
gcgaggtatg 
agaaggacag 
ggtagctctt 
cagcagatta 
tctgacgctc 
aggatcttca 
tatgagtaaa 
atctgtctat 
cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 



taattctgtg 4680 
agaagtatgc 4740 
tccccagcag 4800 
cccctaactc 4860 
ggctgactaa 4920 
cagaagtagt 4980 
tgtatatcca 5040 
tcggcatagt 5100 
ttccggtgct 5160 
ggttctcccg 5220 
tgttcatcag 5280 
tgcgcggcct 5340 
acgcctccgg 5400 
tgcgcgaccc 5460 
tacgagattt 5520 
gggacgccgg 5580 
ccaacttgtt 5640 
caaataaagc 5700 
cttatcatgt 5760 
tgtttcctgt 5820 
taaagtgtaa 5880 
cactgcccgc 5940 
gcgcggggag 6000 
tgcgctcggt 6060 
tatccacaga 6120 
ccaggaaccg 6180 
agcatcacaa 6240 
accaggcgtt 6300 
ccggatacct 6360 
gtaggtatct 6420 
ccgttcagcc 6480 
gacacgactt 6540 
taggcggtgc 6600 
tatttggtat 6660 
gatccggcaa 6720 
cgcgcagaaa 6780 
agtggaacga 6840 
cctagatcct 6900 
cttggtctga 6960 
ttcgttcatc 7020 
taccatctgg 7080 
tatcagcaat 7140 
ccgcctccat 7200 
atagtttgcg 7260 
gtatggcttc 7320 
tgtgcaaaaa 7380 
cagfcgt t ate 7440 
taagatgett 7500 
ggcgacegag 7560 
ctttaaaagt 7620 
cgctgttgag 7680 
ttactttcac 7740 
gaataagggc 7800 
gcatttatca 7860 
aacaaatagg 7920 
7960 



<210> 31 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: primer 
<400> 31 

atgggatcca agatgaagcg cgcaagaccg 

<210> 32 
<211> 30 
<212> DNA 

<213 > Artificial Sequence 

<223> Description of Artificial Sequence: primer 

<4 00> 32 ^ ^ 

cactatagcg gccgcattct cagtcatctt 

<210> 33 
<211> 7989 
<212> DNA 

<213 > Artificial Sequence 



30 



30 



< 220 > 



<223 > Description of Artificial Sequence: plasmid pDV69 



<400> 33 

gacggatcgg 

ccgcatagtt 

cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 

gtttaaactt 

gggccagctg 

tgtcagtttc 

gggtggccgc 

acgacccgta 

cgcgatcggc 

gccattcggg 

tgtgcagggt 

agcagaggcg 

ccggggggtc 

tcttgcatcc 

atgggttgag 

aaatgtcgta 

caccgcggat 

gaccgaggtt 

gtgagttgga 

ccgcgtcacg 

cctgcacgtc 

cctttttttt 

ggatcggaaa 

gagtccgcat 



gagatctccc 

aagccagtat 

ttaagctaca 

gcgttttgcg 

tagttattaa 

cgttacataa 

gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 

aagcttggta 

ttggggtgag 

caaaaacgag 

atccatctgg 

gagggcgttg 

gcgctccttg 

aaagacggtg 

gacaaggtca 

gccgcccttg 

tgcgtccacg 

ttgcaagtct 

tgggggaccc 

aacgtagagg 

gctggcgcgc 

gctacgggcg 

tgatatggtt 

cacgaaggag 

tagggcgcag 

ccacagctcg 

cccgtcggcc 

cgaccggatc 



gatcccctat 

ctgctccctg 

acaaggcaag 

ctgcttcgcg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 

ccgagctcgg 

tactccctct 

gaggatttga 

tcagaaaaga 

gacagcaact 

gccgcgatgt 

gtgcgctcgt 

acgctggtgg 

cgcgagcaga 

gtaaagaccc 

agcgcctgct 

catggcatgg 

ggctctctga 

acgtaatcgt 

ggctgctctg 

ggacgctgga 

gcgtaggagt 

tagtccaggg 

cggttgagga 

tccgaacgag 

ggaaaacctc 



ggtcgactct 

cttgtgtgtt 

gcttgaccga 

atgtacgggc 

ttacggggtc 

atggcccgcc 

ttcccatagt 

aaactgccca 

tcaatgacgg 

ctacttggca 

agtacatcaa 

ttgacgtcaa 

acaactccgc 

gcagagctct 

ctcactatag 

atccactctc 

gaaaagcggg 

tattcacctg 

caatcttttt 

tggcgatgga 

ttagctgcac 

cgggcaccag 

ctacctctcc 

atggcggtag 

cgggcagcag 

gccatgcgcg 

ggtgggtgag 

gtattccaag 

atagttcgtg 

ctcggaagac 

agacgttgaa 

cgcgcagctt 

tttccttgat 

caaactcttc 

atccgtactc 

tcgagaaagg 



cagtacaatc 

ggaggtcgct 

caattgcatg 

cagatatacg 

attagttcat 

tggctgaccg 

aacgccaata 

cttggcagta 

taaatggccc 

gtacatctac 

tgggcgtgga 

tgggagtttg 

cccattgacg 

ctggctaact 

ggagacccaa 

ttccgcatcg 

catgacttct 

gcccgcggtg 

gttgtcaagc 

gcgcagggtt 

gtattcgcgc 

gtgcacgcgc 

gcgtaggcgc 

ggggtctagc 

gcgcgcgtcg 

ggcggcaagc 

cgcggaggcg 

atatgtaggg 

cgagggagcg 

tatctgcctg 

gctggcgtct 

gttgaccagc 

gatgtcatac 

gcggtctttc 

cgccgccgag 

cgtctaacca 



tgctctgatg 60 
gagtagtgcg 120 
aagaatctgc 180 
cgttgacatt 240 
agcccatata 300 
cccaacgacc 360 
gggactttcc 420 
catcaagtgt 480 
gcctggcatt 540 
gtattagtca 600 
tagcggtttg 660 
ttttggcacc 720 
caaatgggcg 780 
agagaaccca 840 
gctggctagc 900 
ctgtctgcga 960 
gcgctaagat 1020 
atgcctttga 1080 
ttggtggcaa 1140 
tggtttttgt 1200 
gcaacgcacc 1260 
caaccgcggt 1320 
tcgttggtcc 1380 
tgcgtctcgt 1440 
aagtagtcta 1500 
gcgcgctcgt 1560 
tacatgccgc 1620 
tagcatcttc 1680 
aggaggtcgg 1740 
aagatggcat 1800 
gtgagaccta 1860 
tcggcggtga 1920 
ttatcctgtc 1980 
cagtactctt 2040 
ggacctgagc 2100 
gtcacagtcg 2160 
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caagatccaa 
atgacacgga 
atgggtttca 
cctccaatgg 
accttacctc 
taaacctgga 
ccgcacctct 
tgcacgactc 
tagccctgca 
cctcaccccc 
atacacaaaa 
taaacacttt 
ctaaagttac 
gaggactaag 
atgctcaaaa 
acaacttgga 
aaaagcttga 
ccattaatgc 
tcaaaacaaa 
taggaactgg 
ataagctaac 
aacaaaaccc 
gatatgtaac 
tctccattaa 
ctcttaaaac 
ttatgccaag 
attatatttt 
ttactgttat 
tatggtcctt 
catttacctt 
tagagggccc 
tgttgtttgc 
ttcctaataa 
gggtggggtg 
ggatgcggtg 
ggggtatCCC 
cagcgtgacc 
ctttctcgcc 
gttccgattt 
acgtagtggg 
ctttaatagt 
ttttgattta 

acaaaaattt 

ccaggctccc 

gtgtggaaag 

gtcagcaacc 

cgcccattct 

ctctgcctct 

caaaaagctc 

taatcatcgg 

gccaagttga 

ttctggaccg 

gtccgggacg 

accctggcct 

gtgtccacga 

tgggggcggg 

gagcaggact 

ggcttcggaa 

ctggagttct 

aatagcatca 

fcccaaactca 

gcgtaatcat 



gatgaagcgc 
aaccggtcct 
agagagtccc 
catgcttgcg 
ccaaaatgta 
aatatctgca 
aatggtcgcg 
caaacttagc 
aacatcaggc 
tctaactact 
tggaaaacta 
gaccgtagca 
tggagccttg 
gattgattct 
ccaactaaat 
tattaactac 
ggttaaccta 
aggagatggg 
aattggccat 
ccttagtttt 

tttgtggacc 
agatagcaaa 
gctaatggga 
tgtagaacta 
agatctagaa 
tactacagcg 
tggtcaatgc 
gcttaataaa 
gaatgctggt 
ttcctatatt 
gtttaaaccc 
ccctcccccg 
aatgaggaaa 
gggcaggaca 
ggctctatgg 
cacgcgccct 
gctacacttg 
acgttcgccg 
agtgctttac 
ccatcgccct 
ggactcttgt 
taagggattt 
aacgcgaatt 
caggcaggca 
tccccaggct 
atagtcccgc 
ccgccccatg 
gagctattcc 
ccgggagctt 
catagtatat 
ccagtgccgt 
accggctcgg 
acgtgaccct 
gggtgtgggt 
acttccggga 
agttcgccct 
gacacgtgct 
tcgttttccg 
tcgcccaccc 
caaatttcac 
tcaatgtatc 
ggtcatagct 



gcaagaccgt 
ccaactgtgc 
cctggggtac 
ctcaaaatgg 
accactgtga 
cccctcacag 
ggcaacacac 
attgccaccc 
cccctcacca 
gccactggta 
ggactaaagt 
actggtccag 
ggttttgatt 
caaaacagac 
ctaagactag 
aacaaaggcc 
agcactgcca 
cttgaatttg 
ggcctagaat 
gacagcacag 
ggtccaaaac 
ctaactttaa 
gcctcagact 
tactttgatg 
ctaaaataca 
tatccatttg 
tactacaaag 
cgcctgccag 
ctagctccag 
agagaagatg 
gctgatcagc 
tgccttcctt 
ttgcatcgca 
gcaaggggga 
cttctgaggc 
gtagcggcgc 
ccagcgccct 
gctttccccg 
ggcacctcga 
gatagacggt 
tccaaactgg 
tggggatttc 
aattctgtgg 
gaagtatgca 
ccccagcagg 
ccctaactcc 
gctgactaat 
agaagtagtg 
gtatatccat 
cggcatagta 
tccggtgctc 
gttctcccgg 
gttcatcagc 
gcgcggcctg 
cgcctccggg 
gcgcgacccg 
acgagatttc 
ggacgccggc 
caacttgttt 
aaataaagca 
ttatcatgtc 
gtttcctgtg 



ctgaagatac 
cttttcttac 
tctctttgcg 
gcaacggcct 
gcccacctct 
ttacctcaga 
tcaccatgca 
aaggacccct 
ccaccgatag 
gcttgggcat 
acggggctcc 
gtgtgactat 
cacaaggcaa 
gccttatact 
gacagggccc 
tttacttgtt 
aggggttgat 
gttcacctaa 
ttgattcaaa 
gtgccattac 
cagaagccaa 
tccttgtaaa 
acgttaacac 
ccactggtca 
agcaaaccgc 
tccttcctaa 
caagcgatgg 
atagtcgcac 
aaactactca 
actgattttt 
ctcgactgtg 
gaccctggaa 
ttgtctgagt 
ggattgggaa 
ggaaagaacc 
attaagcgcg 
agcgcccgct 
tcaagctcta 
ccccaaaaaa 
ttttcgccct 
aacaacactc 
ggcctattgg 
aatgtgtgtc 
aagcatgcat 
cagaagtatg 
gcccatcccg 
tttttttatt 

aggaggcttt 
tttcggatct 
taatacgaca 
accgcgcgcg 
gacttcgtgg 
gcggtccagg 
gacgagctgt 
ccggccatga 
gccggcaact 
gattccaccg 
tggatgatcc 
attgcagctt 
tttttttcac 

tgtataccgt 
tgaaattgtt 



cttcaacccc 
tcctcccttt 
cctatccgaa 
ctctctggac 
caaaaaaacc 
agccctaact 
atcacaggcc 
cacagtgtca 
cagtaccctt 
tgacttgaaa 
tttgcatgta 
taataatact 
tatgcaactt 
tgatgttagt 

tctttttata 
tacagcttca 
gtttgacgct 
tgcaccaaac 
caaggctatg 
agtaggaaac 
ctgcataatt 
aaatggagga 
cttatttaaa 
tatattacca 
tgactttagt 
tgcgggaaca 
tgcccttttt 
atcctatgtt 
ggcaaccctc 
aagaagcggc 
ccttctagtt 
ggtgccactc 
aggtgtcatt 
gacaatagca 
snccntagct 
gcgggtgtgg 
cctttcgctt 
aatcggggca 
cttgattagg 
ttgacgttgg 
aaccctatct 
ttaaaaaatg 
agttagggtg 
ctcaattagt 
caaagcatgc 
cccctaactc 
tatgcagagg 
tttggaggcc 
gatcagcacg 
aggtgaggaa 
acgtcgccgg 
aggacgactt 
accaggtggt 
acgccgagtg 
ccgagatcgg 
gcgtgcactt 
ccgccttcta 
tccagcgcgg 
ataatggtta 
tgcattctag 
cgacctctag 
atccgctcac 



gtgtatccat 2220 
gtatccccca 2280 
cctctagtta 2340 
gaggccggca 2400 
aagtcaaaca 2460 
gtggc:tg cc g 2 520 
ccgctaaccg 2580 
gaaggaaagc 2640 
actatcactg 2700 
gagcccattt 2760 
acagacgacc 2820 
tccttgcaaa 2880 
aatgtagcag 2940 
tatccgtttg 3000 
aactcagccc 3060 
aacaattcca 3120 
acagccatag 3180 
acaaatcccc 3240 
gttcctaaac 3300 
aaaaataatg 3360 
gaatacggga 3420 
attgttaatg 3480 
aacaaaaatg 3540 
gactcatctt 3600 
gcaagaggtt 3660 
cataatgaaa 3720 
ccgttggaag 3780 
atgacttttt 3840 
ataacctccc 3900 
cgctcgagtc 3960 
gccagccatc 4020 
ccactgtcct 4080 
ctattctggg 4140 
ggcatgctgg 4200 
ggggctetag 4260 
tggttacgcg 4320 
tcttcccttc 4380 
tccctttagg 4440 
gtgatggttc 4500 
agtccacgtt 4560 
cggtctattc 4620 
agctgattta 4680 
tggaaagtcc 4740 
cagcaaccag 4800 
atctcaatta 4860 
cgcccagttc 4920 
ccgaggccgc 4980 
taggcttttg 5040 
tgttgacaat 5100 
ctaaaccatg 5160 
agcggtcgag 5220 
cgccggtgtg 5280 
gccggacaac 5340 
gtcggaggtc 5400 
cgagcagccg 5460 
cgtggccgag 5520 
tgaaaggttg 5580 
ggatctcatg 5640 
caaataaagc 5700 
ttgtggtttg 5760 
ctagagcttg 5820 
aattccacac 5880 
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aacatacgag 

acattaattg 

cattaatgaa 

tcctcgctca 

tcaaaggcgg 

gcaaaaggcc 

aggctccgcc 

ccgacaggac 

gttccgaccc 

ctttctcaat 

ggctgtgtgc 

cttgagtcca 

attagcagag 

ggctacacta 

aaaagagttg 

gtttgcaagc 

tctacggggt 

ttatcaaaaa 

taaagtatat 

atctcagcga 

actacgatac 

cgctcaccgg 

agtggtcctg 

gtaagtagtt 

gtgtcacgct 

gttacatgat 

gtcagaagta 

cttactgtca 

ttctgagaat 

accgcgccac 

aaact c t caa 

aactgatctt 

caaaatgccg 

ctttttcaat 

gaatgtattt 

cctgacgtc 



ccggaagcat 

cgttgcgctc 

tcggccaacg 

ctgactcgct 

taatacggtt 

agcaaaaggc 

cccctgacga 

tataaagata 

tgccgcttac 

gctcacgctg 

acgaaccccc 

acccggtaag 

cgaggtatgt 

gaaggacagt 

gtagctcttg 

agcagattac 

ctgacgctca 

ggatcttcac 

atgagtaaac 

tctgtctatt 

gggagggctt 

ctccagattt 

caactttatc 

cgccagttaa 

cgtcgtttgg 

cccccatgtt 

agttggccgc 

tgccatccgt 

agtgtatgcg 

atagcagaac 

ggatcttacc 

cagcatcttt 

caaaaaaggg 

attattgaag 

agaaaaataa 



aaagtgtaaa 

actgcccgct 

cgcggggaga 

gcgctcggtc 

atccacagaa 

caggaaccgt 

gcatcacaaa 

ccaggcgttt 

cggatacctg 

taggtatctc 

cgttcagccc 

acacgactta 

aggcggtgct 

atttggtatc 

atccggcaaa 

gcgcagaaaa 

gtggaacgaa 

ctagatcctt 

ttggtctgac 

tcgttcatcc 

accatctggc 

atcagcaata 

cgcctccatc 

tagtttgcgc 

tatggcttca 

gtgcaaaaaa 

agtgttatca 

aagatgcttt 

gcgaccgagt 

tttaaaagtg 

gctgttgaga 

tactttcacc 

aataagggcg 

catttatcag 

acaaataggg 



gcctggggtg 

ttccagtcgg 

ggcggtttgc 

gttcggctgc 

tcaggggata 

aaaaaggccg 

aatcgacgct 

ccccctggaa 

tccgcctttc 

agttcggtgt 

gaccgctgcg 

tcgccactgg 

acagagttct 

tgcgctctgc 

caaaccaccg 

aaaggatctc 

aactcacgtt 

ttaaattaaa 

agttaccaat 

atagttgcct 

cccagtgctg 

aaccagccag 

cagtctatta 

aacgttgttg 

ttcagctccg 

gcggttagct 

ctcatggtta 

tctgtgactg 

tgctcttgcc 

ctcatcattg 

tccagttcga 

agcgtttctg 

acacggaaat 

ggttattgtc 

gttccgcgca 



cctaatgagt 

gaaacctgtc 

gtattgggcg 

ggcgagcggt 

acgcaggaaa 

cgttgctggc 

caagtcagag 

gctccctcgt 

tcccttcggg 

aggtcgttcg 

ccttatccgg 

cagcagccac 

tgaagtggtg 

tgaagccagt 

ctggtagcgg 

aagaagatcc 

aagggatttt 

aatgaagttt 

gcttaatcag 

gactccccgt 

caatgatacc 

ccggaagggc 

attgttgccg 

ccattgctac 

gttcccaacg 

ccttcggtcc 

tggcagcact 

gtgagtactc 

cggcgtcaat 

gaaaacgttc 

tgtaacccac 

ggtgagcaaa 

gttgaatact 

tcatgagcgg 

catttccccg 



gagctaactc 

gtgccagctg 

ctcttccgct 

atcagctcac 

gaacatgtga 

gtttttccat 

gtggcgaaac 

gcgctctcct 

aagcgtggcg 

ctccaagctg 

taactatcgt 

tggtaacagg 

gcctaactac 

taccttcgga 

tggttttttt 

tttgatcttt 

gg teat gaga 

taaatcaatc 

tgaggcacct 

cgtgtagata 

gcgagaccca 

cgagcgcaga 

ggaagetaga 

aggeategtg 

atcaaggcga 

teegategtt 

gcataattct 

aaccaagtca 

acgggataat 

ttcggggcga 

tcgtgcaccc 

aacaggaagg 

catactcttc 

atacatattt 

aaaagtgcca 



5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

7989 



<210> 34 
<211> 7607 
<212 > DNA 

<213 > Artificial Sequence 



<223> Description of Artificial Sequence: plasmid GRE5-El-SV40-Hygro 



<400> 34 

tetagaagat 

ttctagctac 

gtacaggatg 

agategatet 

gcgcgttcgt 

aacttcaggg 

atgttatatg 

tatcaccatg 

ttgtctcctc 

ttgtaacgaa 

tcactttttt 

acaattgtta 

ggaaatattc 

tggttacaat 

ccctctgcta 

gttattgtgc 



ccgctgtaca 

tttattagat 

ttctagctac 

cctggccgtt 

cctcactctc 

tgagtttggg 

gagggggcaa 

gaccctcatg 

ttattttctt 

tttttaaatt 

ttcaaggcaa 

taattaaatg 

ttattggtag 

gatatacact 

accatgttca 

tgtctcatca 



ggatgttcta 

ccgctgtaca 

tttattagat 

cggggtcaaa 

ttccgcatcg 

gacccttgat 

agttttcagg 

ataattttgt 

ttcattttct 

cacttttgtt 

tcagggtata 

ataaggtaga 

aaacaactac 

gtttgagatg 

tgccttcttc 

ttttggcaaa 



getaetttat 

ggatgttcta 

ccgtgtacag 

aaccaggttt 

ctgtctgcga 

tgttctttct 

gtgttgttta 

ttctttcact 

gtaacttttt 

tatttgtcag 

ttatattgta 

atatttctgc 

atcctggtca 

aggataaaat 

tttttcctac 

gaattagatc 



tagatccgct 

getaetttat 

gatgttctag 

ggctataaaa 

gggecaggat 

ttttcgctat 

gaatgggaag 

ttctactctg 

cgttaaactt 

attgtaagta 

cttcagcaca 

atataaattc 

tcatcctgcc 

actctgagtc 

agctcctggg 

taagcttctg 



gtacaggatg 60 
tagatccgct 120 
etaetttatt 180 
gggggtgggg 240 
cgatcctgag 300 
tgtaaaattc 360 
atgtcccttg 420 
ttgacaacca 480 
tagettgeat 540 
ctttctctaa 600 
gttttagaga 660 
tggctggcgt 720 
tttctcttta 780 
caaaccgggc 840 
caacgtgctg 900 
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actcggtcga 
ggccgccagt 
tagccatttt 
agatcccaac 
ggaagggatt 
ttcccggcag 
tgtaccggag 
ggatgaagag 
gtcttgtcat 
tatgaggacc 
agtggtgggt 
ttttgtattg 
gaaccggagc 
ccgacatcac 
tctaacacac 
gtgagagttg 
gggcaacctt 
cgtgtgtggt 
ataatgttta 
cgtgggctaa 
tctgctgtgc 
ctgtggggct 
tttgaagagc 
gcgcttttcc 
gctgctgttg 
ggggggtacc 
aatcgcctgc 
cagcagcagg 
ggcctggacc 
gcattttgac 
cttgtgaggc 
ctgagtgtat 
cgcagaagta 
aggaggctat 
gcaaacttgt 
tagatacgga 
ttggcatgga 
cggttttcct 
atacctgtgt 
ggaagggggt 
ggtgtacctt 
actgtggttg 
gcaactgcga 
tgaagaccat 
tactgacccg 
gcaatttgag 
acggggtgtt 
ccaggtgcag 
atgtgaccga 
gctctagcga 
gaaagaatat 
ccgccatgag 
tgcccccatg 
tcctgcccgc 
ctgcagcctc 
ttgctttcct 
agttgacggc 
agcagctgtt 
cggtttaaaa 
gctgtctcag 
ttaaagcaga 
aatttcacaa 



ctgaaaatga 
cttttggacc 
gaaccaccta 
gaggaggcgg 
gacttactca 
cccgagcagc 
gtgatcgatc 
ggtgaggagt 
tatcaccgga 
tgtggcatgt 
ttggtgtggt 
tgattttttt 
ctgcaagacc 
ctgtgtctag 
ctcctgagat 
gtgggcgtcg 
tggacttgag 
taacgccttt 
acttgcatgg 
tcttggttac 
gtaacttgct 
catcccaggc 
ttttgaaatc 
aagagaaggt 
cttttttgag 
tgctggattt 
tactgttgtc 
aggaagccag 
ctcgggaatg 
aattacagag 
tacagaggag 
tacttttcaa 
ttccatagag 
tagggtatat 
aaatatcagg 
ggatagggtg 
cggggtggtt 
ggccaatacc 
ggaagcctgg 
ggtgtgtcgc 
gggtatcctg 
cttcatgcta 
ggacagggcc 
tcacgtagcc 
ctgttccttg 
tcacactaag 
tgacatgacc 
accctgcgag 
ggagctgagg 
tgaagataca 
ataaggtggg 
caccaactcg 
ggccggggtg 
aaactctact 
cgccgccgct 
gagcccgctt 
tcttttggca 
ggatctgcgc 
cataaataaa 
ctgactgctt 
acttgtttat 
ataaagcatt 



gacatattat 

agctgatcga 

cccttcacga 

tttcgcagat 

cttttccgcc 

cggagcagag 

ttacctgcca 

ttgtgttaga 

ggaatacggg 

ttgtctacag 

aatttttttt 

aaaaggtcct 

tacccgccgt 

agaatgcaat 

acacccggtg 

ccaggctgtg 

ctgtaaacgc 

gtttgctgaa 

cgtgttaaat 

atctgacctc 

ggaacagagc 

aaagttagtc 

ctgtggtgag 

catcaagact 

ttttataaag 

tctggccatg 

ttccgtccgc 

gcggcggcgg 

aatgttgtac 

gatgggcagg 

gctaggaatc 

cagatcaagg 

cagctgacca 

gcaaaggtgg 

aattgttgct 

gcctttagat 

attatgaatg 

aaccttatcc 

accgatgtaa 

cccaaaagca 

tctgagggta 

gtgaaaagcg 

tctcagatgc 

agccactctc 

catttgggta 

atattgcttg 

atgaagatct 

tgtggcggta 

cccgatcact 

gattgaggta 

ggtcttatgt 

tttgatggaa 

cgtcagaatg 

accttgacct 

tcagccgctg 

gcaagcagtg 

caattggatt 

cagcaggttt 

aaaccagact 

aagtcgcaag 

tgcagcttat 

tttttcactg 



ctgccacgga 

agaggtactg 

actgtatgat 

ttttcccgac 

ggcgcccggt 

agccttgggt 

cgaggctggc 

ttatgtggag 

ggacccagat 

taagtgaaaa 

ttaattttta 

gtgtctgaac 

cctaaaatgg 

agtagtacgg 

gtcccgctgt 

gaatgtatcg 

cccaggccat 

tgagttgatg 

ggggcggggc 

atggaggctt 

tctaacagta 

tgcagaatta 

ctgtttgatt 

ttggattttt 

gataaatgga 

catctgtgga 

ccggcgataa 

caggagcaga 

aggtggctga 

ggctaaaggg 

tagcttttag 

ataattgcgc 

cttactggct 

cacttaggcc 

acatttctgg 

gtagcatgat 

taaggtttac 

tacacggtgt 

gggttcgggg 

gggcttcaat 

actccagggt 

tggctgtgat 

tgacctgctc 

gcaaggcctg 

acaggagggg 

agcccgagag 

ggaaggtgct 

aacatattag 

tggtgctggc 

ctgaaatgtg 

agttttgtat 

gcattgtgag 

tgatgggctc 

acgagaccgt 

cagccaccgc 

cagcttcccg 

ctttgacccg 

ctgccctgaa 

ctgtttggat 

ccgaattgga 

aatggttaca 

cattctagtt 



ggtgttatta 

gctgataatc 

ttagacgtga 

tctgtaatgt 

tctccggagc 

ccggtttcta 

tttccaccca 

caccccgggc 

attatgtgtt 

ttatgggcag 

cagttttgtg 

ctgagcctga 

cgcctgctat 

atagctgtga 

gccccattaa 

aggacttgct 

aaggtgtaaa 

taagtttaat 

ttaaagggta 

gggagtgttt 

cctcttggtt 

aggaggatta 

ctttgaatct 

ccacaccggg 

gcgaagaaac 

gagcggttgt 

taccgacgga 

gcccatggaa 

actgtatcca 

ggtaaagagg 

cttaatgacc 

taatgagctt 

gcagccaggg 

agattgcaag 

gaacggggcc 

aaatatgtgg 

tggccccaat 

aagcttctat 

ctgtgccttt 

taagaaatgc 

gcgccacaat 

taagcataac 

ggacggcaac 

gccagtgttt 

ggtgttccta 

catgtccaag 

gaggtacgat 

gaaccagcct 

ctgcacccgc 

tgggcgtggc 

ctgttttgca 

ctcatatttg 

cagcattgat 

gtctggaacg 

ccgcgggatt 

ttcatccgcc 

ggaacttaat 

ggcttcctcc 

ttggatcaag 

tccaattcgg 

aataaagcaa 

gtggtttgtc 



ccgaagaaat 
ttccacctcc 
cggcccccga 
tggcggtgca 
cgcctcacct 
tgccaaacct 
gtgacgacga 
acggttgcag 
cgctttgcta 
tgggtgatag 
gtttaaagaa 
gcccgagcca 
cctgagacgc 
ctccggtcct 
accagttgcc 
taacgagcct 
cctgtgattg 
aaagggtgag 
tataatgcgc 
ggaagatttt 
ttggaggttt 
caagtgggaa 
gggtcaccag 
gcgcgctgcg 
ccatctgagc 
gagacacaag 
ggagcagcag 
cccgagagcc 
gaactgagac 
gagcgggggg 
agacaccgtc 
gatctgctgg 
gatgattttg 
tacaagatca 
gaggtggaga 
ccgggggtgc 
tttagcggta 
gggtttaaca 
tactgctgct 
ctctttgaaa 
gtggcctccg 
atggtatgtg 
tgtcacctgc 
gagcataaca 
ccttaccaat 
gtgaacctga 
gagacccgca 
gtgatgctgg 
gctgagtttg 
ttaagggtgg 
gcagccgccg 
acaacgcgca 
ggtcgccccg 
ccgttggaga 
gtgactgact 
cgcgatgaca 
gtcgtttctc 
cctcccaatg 
caagtgtctt 
atcgatctta 
tagcatcaca 
caaactcatc 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 
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aatgtatctt 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 
agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 
acttggtctg 
tttcgttcat 
ttaccatctg 
ttatcagcaa 
tccgcctcca 
aatagtttgc 
ggtatggctt 
ttgtgcaaaa 
gcagtgttat 
gtaagatgct 
cggcgaccga 
actttaaaag 

ccgctgttga 

tttactttca 

ggaataaggg 

agcatttatc 

aaacaaatag 

attattatca 

gcgtttcggt 

ttgtctgtaa 

cgggtgtcgg 

tatgcggtgt 

agcgttaata 

caataggccg 

agtgttgttc 

gggcgaaaaa 

tttttggggt 

agagcttgac 

gcgggcgcta 

gcgcttaatg 

gaagggcgat 

gcaaggcgat 

gccagtgaat 



atcatgtctg 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
'‘tgtccgcctt 
tcagttcggt 
ccgaccgctg 
tatcgccact 
ctacagagtt 
tctgcgctct 
aacaaaccac 
aaaaaggatc 
aaaactcacg 
ttttaaatta 
acagttacca 
ccatagttgc 
gccccagtgc 
taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 
gatccagttc 
ccagcgtttc 
cgacacggaa 
agggttattg 
gggttccgcg 
tgacattaac 
gatgacggtg 
gcggatgccg 
ggctggctta 
gaaataccgc 
ttttgttaaa 
aaatcggcaa 
cagtttggaa 
ccgtctatca 
cgaggtgccg 
ggggaaag cc 
gggcgctggc 
cgccgctaca 
cggtgcgggc 
taagttgggt 
tgtaatacga 



gtcgactcta 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 
cgccttatcc 
ggcagcagcc 

cttgaagtgg 
gctgaagcca 
cgctggtagc 
tcaagaagat 
ttaagggatt 
aaaatgaagt 
atgcttaatc 
ctgactcccc 
tgcaatgata 
agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtac 
cccggcgtca 
tggaaaacgt 
gatgtaaccc 
tgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 
ctataaaaat 
aaaacctctg 
ggagcagaca 
actatgcggc 
acagatgcgt 
attcgcgtta 
aatcccttat 
caagagtcca 
gggcgatggc 
taaagcacta 
ggcgaacgtg 
aagtgtagcg 
gggcgcgtec; 
ctcttcgcta 
aacgccaggg 
ctcactatag 



gactcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 
cgctccaagc 
ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 
ggtggttttt 

cctttgatct 
ttggtcatga 
tttaaatcaa 
agtgaggca c 
gtcgtgtaga 
ccgcgagacc 
gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 
actcgtgcac 
aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 
aggcgtatca 
acacatgcag 
agcccgtcag 
atcagagcag 
aaggagaaaa 
aatttttgtt 
aaatcaaaag 
ctattaaaga 
ccactacgtg 
aatcggaacc 
. gcgagaaagg 
gtcacgctgc 
cattcgccat 
ttacgccagc 
ttttcccagt 
ggcgaattaa 



cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 
gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 
tctaaagtat 
ctatctcagc 
taactacgat 
cacgctcacc 
gaagtggtcc 
gagtaagtag 
tggtgtcacg 
gagttacatg 
ttgtcagaag 
ctcttactgt 
cattctgaga 
ataccgcgcc 
gaaaactctc 
ccaactgatc 
ggcaaaatgc 
tcctttttca 
ttgaatgtat 
cacctgacgt 
cgaggcccct 
ctcccggaga 
ggcgcgtcag 
attgfcactga 
taccgcatca 
aaatcagctc 
aatagaccga 
acgtggactc 
aaccatcacc 
ctaaagggag 
aagggaagaa 
gcgtaaccac 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
ttcgggg 



cactgactcg 4740 
ggtaatacgg 4800 
ccagcaaaag 4860 
cccccctgac 4920 
actataaaga 4980 
cctgccgctt 5040 
tagctcacgc 5100 
gcacgaaccc 5160 
caacccggta 5220 
agcgaggtat 5280 
tagaaggaca 5340 
tggtagctct 5400 
gcagcagatt 5460 
gtctgacgct 5520 
aaggatcttc 5580 
atatgagtaa 5640 
gatctgtcta 5700 
acgggagggc 5760 
ggctccagat 5820 
tgcaacttta 5880 
ttcgccagtt 5940 
ctcgtcgttt 6000 
atcccccatg 6060 
taagttggcc 6120 
catgccatcc 6180 
atagtgtatg 6240 
acatagcaga 6300 
aaggatctta 6360 
ttcagcatct 6420 
cgcaaaaaag 6480 
atattattga 6540 
ttagaaaaat 6600 
ctaagaaacc 6660 
ttcgtctcgc 6720 
cggtcacagc 6780 
cgggtgttgg 6840 
gagtgcacca 6900 
ggaaattgta 6960 
attttttaac 7020 
gatagggttg 7080 
caacgtcaaa 7140 
ctaatcaagt 7200 
cccccgattt 7260 
agcgaaagga 7320 
cacacccgcc 7380 
caactgttgg 7440 
gggatgtgct 7500 
taaaacgacg 7560 
7607 



<210> 35 
<211> 11600 
<212> DNA 

<213 > Artificial Sequence 



< 220 > 
<223 > 



Description of Artificial 



Sequence : 



plasmid MMTV-E2a-SV40-Neo 



gaattccgca ttgcagagat attgtattta 
tgaccattca ccacattggt gtgcacctcc 



agtgcctagc 

aagcttgggc 



tcgatacaat aaacgccatt 60 
agaaatggtt gaactcccga 120 
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gagtgtccta 
tctgcgcaca 
ggcatagctc 
ctcacccttg 
aactcgacct 
ttttgtcctt 
accaatccaa 
actattttta 
acctcaattg 
aaaaagagtg 
ccttacatct 
tgggataggt 
gaaagactcg 
atgaaacaac 
attttccata 
catggcatga 
agtaagtttt 

cttggtttgg 

ttggaattta 

attttttgag 

gccatcccgt 

cggcgaccct 

gtgacccttg 

ggctatcatc 

cgtccaagac 

gcgccctgcc 

ggtatcttgc 

acggggtaga 

ggtacgcgcg 

ggcacatcgc 

ttcaacagca 

acttgttcct 

gagcggtcct 

tcctcaagcg 

gcggtggtgg 

atctttttct 

acccccgagc 

ccgtccccgt 

gagtccgagg 

cggccatcga 

tctcccaagc 

gaagatgtgg 

aaaggaggta 

cggacgcaag 

ccgctgagtg 

atggacaagt 

gtggaagctc 

ctgaccttca 

ctgcagtcgt 

ctgcaccgct 

aataaggagc 

gagcagtcta 

tccaacaccg 

ggcaagtctt 

atcaaggctt 

atgccactac 

ctaccaaagt 

tccgacaaga 

aaccctgtgt 

tcggcgcccg 

accgagctgc 

aacgtgtccc 



cacctagggg 
aacggatgag 
tgctttgcct 
actcttttaa 
tcctcctgag 
cagaaataga 
taggtagatt 
ctcaaattca 
aagaacaggt 
tttttgtcaa 
acagaccaac 
gggttacagt 
ccagagctag 
aggtacatga 
ccaaggaggg 
gttattatga 
tggttacaaa 
tatcaaaggt 
tccaaatctt 
taaacttgca 
ctccgctcgt 
caggtcggcc 
tctctatttc 
acaagagcgg 
cctcaaagat 
gcaaggccag 
agttttggaa 
agttgaggcg 
gtgagaatag 
tgcgctcttg 
cgtcgtctcc 
cgtttgcctc 
cgtcgtcttc 
ggggtgcctc 
cgaactcaga 
gcctatagga 
gcggacgcgg 
cgccgccgcc 
acgaggaaga 
cctcggcggc 
ccgagcgccc 
cgctacaaat 
agcgcacagt 
aggaagagga 
tgccgatcgt 
accacgtgga 
tggcggccgt 
ccagcaacaa 
ttgcagaggt 
gcgctgagat 
acgtgattga 
gcaaggccaa 
acgcaaggtg 
gcggcatgtt 
ttatgcaggc 
ggtgcgagtg 
tgactccgtt 
gcgtgctggc 
atcgcaactc 
acctgctaaa 
cgcggatggt 
tgccagtggc 



agaagcagcc 
cccatcagac 
ggggctattg 
tagctcttct 
gcaaggacca 
aataagaatg 
attagttact 
gaagttagaa 
gcaaggacta 
aataggagac 
agatgccccc 
caatggctat 
acctccttgg 
ttatatttat 
gacagtggct 
atagccttta 
ctgttcttaa 
tctgatctga 
atgtaaatgc 
acagtcctaa 
cacttatcct 
gactgcggca 
tactatttgg 
aacggactca 
ttttggcact 
ctgcttgtcc 
aaagatgtga 
cgggttgggc 
gtggcgttcg 
caacgcgtcg 
cacatctagg 
tgcgttgtcc 
gcttacaaaa 
gacggggaag 
gggggcggtt 
gaaggaaatg 
tgcggcgcga 
tccccgggcg 
ctcatcacaa 
ggatttggcc 
gccatcacca 
ggtgggttte 
gcggcggctg 
agagcccagc 
gtctgcgtgg 
taacgatcta 
atgcaagacc 
gacctttgtg 
gacctacaag 
cgaaggcgag 
aatggatgtg 
gatcgtgaag 
ctgcgtgcac 
cttctctgaa 
gctgtatcct 
caactcaaag 
cgccctgagc 
cagcgtgcac 
gcgcgcgcag 
cgcgttggtg 
tgtgcctgag 
gcatagcgat 



aaggggttgt 
aaagacatat 
ggggaagttg 
gtgcaagatt 
cagccaactt 
cttgctaaaa 
atgttaagaa 
atgggaatag 
ttgaccacag 
aggtggtggc 
ttaccatata 
aaagtgttat 
tgtatgttgt 
ctaggaacag 
ggactaatag 
ttggcccaac 
aacgaggatg 
gctctgagtg 
ttatgtaaac 
cattcacctc 
tcactttcca 
gctggcgccc 
tgtttgtctt 
ccatagggac 
tcgttgagcg 
gctcggctgc 
taggtggcaa 
tcgcatgtgc 
taggcaaggc 
cagataatgg 
tagtcgccat 
tggtcttgct 
cctgggtcct 
gtggtaggcg 
aggctgtcct 
gccagtcggg 
cgtcccccaa 
cccccaaaaa 
gacgcgctgg 
attgcgccca 
gaggtaatcg 
agcaacccac 
aatgaagacg 
gaagcggaaa 
gagaagggca 
aaggcgaact 
tggctgaacg 
acgatgatgg 
catcacgagc 
cttaagtgtc 
acgagcgaaa 
aaccggtggg 
gacgcggcct 
ggcgcaaagg 
aacgcccaga 
cctgggcacg 
aacgcggagg 
cacccggcgc 
ggcggaggcc 
atggtgcgca 
tttaagtgga 
gcgcggcaga 



ttcccaccaa 
tcattctctg 
cggttcgtgc 
acaatctaaa 
cctcttacaa 
attatatttt 
atgaatcatt 
aaaatagaaa 
gcctagaagt 
aaccagggac 
caggaagata 
atagatccct 
ctcaagaaga 
gaatgcactt 
aacattattc 
cttgcggttc 
tgagacaagt 
ttctattttc 
caagatataa 
ttgtgtgttt 
gagggtcccc 
gaacagggac 
gtattgtctc 
caagctagcg 
aggcgatatc 
ggttggcacg 
gcacctctgg 
cgttttcttg 
tgacatccgc 
cgcactggcg 
gcctttcgtc 
ttttatcctc 
gctcgataat 
cgttggcggc 
tcttctcgac 
aagaggagca 
ccatggagga 
agcggatgag 
tgccgcgcac 
agaagaaaaa 
tggacagcga 
cggtgctaat 
acccagtggc 
gtgaaattac 
tggaggctgc 
tcaaactact 
aggagcaccg 
ggcgattcct 
ccacgggctg 
tacacggaag 
acgggcagcg 
gccgaaatgt 
gtccggccaa 
ctcaggtggc 
ccgggcacgg 
cgcccttttt 
acctggacgc 
tgatagtgtt 
ccaactgcga 
gcctgtggag 
gcactaaaca 
acccctttga 



ggacgacccg 180 
ctgcaaactt 240 
tcgcagggct 300 
caattcggag 360 
gccgcatcga 420 
taccaataag 480 
atcttttagt 540 
gagacgctca 600 
aaaaaaggga 660 
ttatagggga 720 
tgacttaaat 780 
cccttttcgt 840 
aaaagacgac 900 
ttggggaaag 960 
tgcaaaaact 1020 
ccagggctta 1080 
ggtttcctga 1140 
ctatgttctt 1200 
aagagtgctg 1260 
gtgtctgttc 1320 
ccgcagaccc 1380 
cctcggataa 1440 
tttcttgtct 1500 
cttctcgtcg 1560 
aggtatgaca 1620 
gcaggatagg 1680 
cacggcaaat 1740 
gcgtttgggg 1800 
tatggcgagg 1860 
ctgcagatgc 1920 
cccccgcccg 1980 
tgttggtact 2040 
cacttcctcc 2100 
atcggtggag 2160 
tgactccatg 2220 
gcgcgaaacc 2280 
cgtgtcgtcc 2340 
gcggcgtatc 2400 
acccagcccg 2460 
gaagcgccct 2520 
ggaagaaaga 2580 
caagcatggc 2640 
gcgtggtatg 2700 
ggtgatgaac 2760 
gcgcgcgctg 2820 
gcctgaccaa 2880 
cgggttgcag 2940 
gcaggcgtac 3000 
cgcgttgtgg 3060 
cattatgata 3120 
cgcgctgaag 3180 
ggtgcagatc 3240 
tcagttttcc 3300 
ttttaagcag 3360 
tcaccttttg 3420 
gggaaggcag 3480 
ggatctgatc 3540 
ccagtgctgc 3600 
cttcaagata 3660 
tgaaaacttc 3720 
ccagtatcgc 3780 
tttttaaacg 3840 
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gcgcagacgg 
cctttattga 
gtggcgctcc 
ctgtagagct 
gagttgggta 
cagttggcaa 
ccctgtacgc 
gcgctacggc 
gaaaagccaa 
atccagtgta 
tgtggagaaa 
ccaagatctt 
gttgatgttg 
acacggtttc 
acaaaacata 
ttacctcagg 
aattggacaa 
tataatgtgt 
ctgatgaatg 
aaatgccatc 
agaagagaaa 
atgctgtgtt 
ctgcactgct 
acagttataa 
ttaataacta 
aggaatattt 
gaggttttac 
aatgcaattg 
agcatcacaa 
aaactcatca 
tgtggaaagt 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 
ggaagcggaa 
gctactgggc 
gtgggcttae 
tgccagctgg 
tcttgccgcc 
gatcgtttcg 
agaggctatt 
tccggctgtc 
tgaatgaact 
gcgcagctgt 
tgccggggca 
ctgatgcaat 
cgaaacatcg 
atctggacga 
gcatgcccga 
tggtggaaaa 
gctatcagga 
ctgaccgctt 
atcgccttct 
gacgcccaac 
cttcggaatc 
ggagttcttc 
gctggacgac 
cagcggctat 
tttggtcccg 
tacctacaga 



caagggtggg 
aagtgtctct 
taatctgcgc 
gttcctggtt 
ccccggtaat 
aggcgtggag 
tttgggtgga 
gcaggagtga 
agggctcaaa 
cgcccccagt 
caaagcctgg 
tgacaatggc 
cttgcttctt 
gatgacgccg 
aagaagggtg 
tcgacctcga 
actacctaca 
taaactactg 
ggagcagtgg 
tagtgatgat 
ggtagaagac 
tagtaataga 
atacaagaaa 
tcataacata 
tgctcaaaaa 
gatgtatagt 
ttgctttaaa 
ttgttgttaa 
atttcacaaa 
atgtatctta 
ccccaggctc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctcggcctc 
gcaaaaagct 
cacgtagaaa 
tatctggaca 
atggcgatag 
ggcgccctct 
aaggatctga 
catgattgaa 
cggctatgac 
agcgcagggg 
gcaggacgag 
gctcgacgtt 
ggatctcctg 
gcggcggctg 
catcgagcga 
agagcatcag 
cggcgaggat 
tggccgcttt 
catagcgttg 
cctcgtgctt 
tgacgagttc 
ctgccatcac 
gttttccggg 
gcccaccccg 
ctcgcggagt 
ccgcgcatcc 
gatctttgtg 
gatttaaagc 



ggtaaataat 
agtacattat 
actgtggctg 
gcgacgcagg 
aaggttcatg 
aaacatgcag 
cttttccagc 
ctcgtactca 
gaggtagcat 
ctcgcgaccg 
aaagcgcttg 
tttcagttcc 
tatgttgtgg 
cggtgcggcc 
ggctcgtcca 
gggatctttg 
gagatttaaa 
attctaattg 
tggaatgcct 
gaggctactg 
cccaaggact 
actcttgctt 
attatggaaa 
ctgttttttc 
ttgtgtacct 
gccttgacta 
aaacctccca 
cttgtttatt 
taaagcattt 
tcatgtctgg 
cccagcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
tcacgctgcc 
gccagtccgc 
agggaaaacg 
ctagactggg 
ggtaaggttg 
tggcgcaggg 
caagatggat 
tgggcacaac 
cgcccggttc 
gcagcgcggc 
gtcactgaag 
tcatctcacc 
catacgcttg 
gcacgtactc 
gggctcgcgc 
ctcgtcgtga 
tctggattca 
gctacccgtg 
tacggtatcg 
ttctgagcgg 
gagatttcga 
acgccggctg 
ggctcgatcc 
tctaccggca 
atgcccccga 
aaggaacctt 
tctaaggtaa 



cacccgagag 
ttttacatgt 
cggaagtagg 
gtgggctgta 
gtggggttgt 
cagaatagtc 
gttatacagc 
aactggtaaa 
gtttttgagt 
gccgtattga 
tcataggtgc 
tgctcactgg 
cgttgccggc 
ggtgcacacg 
tgggatccat 
tgaaggaacc 
gctctaaggt 
tttgtgtatt 
ttaatgagga 
ctgactctca 
ttccttcaga 
gctttgctat 
aatattctgt 
ttactccaca 
ttagcttttt 
gagatcataa 
cacctccccc 
gcagcttata 
ttttcactgc 
atccggctgt 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
gcaagcactc 
agaaacggtg 
caagcgcaaa 
cggttttatg 
ggaagccctg 
gatcaagatc 
tgcacgcagg 
agacaatcgg 
tttttgtcaa 
tatcgtggct 
cgggaaggga 
ttgctcctgc 
atccggctac 
ggatggaagc 
cagccgaact 
cccatggcga 
tcgactgtgg 
atattgctga 
ccgctcccga 
gactctgggg 
ttccaccgcc 
gatgatcctc 
cctcgcgagt 
gtgcaaatcc 
actgcaggag 
acttctgtgg 
atataaaatt 



tgtacaaata 
ttttcaagtg 
gcgagtggcg 
cctggggact 
gatccatggg 
cacaggcggc 
ggtcggggga 
cctgcttgag 
gcgggttcca 
ctatggcgca 
ccaaaaaata 
agcccatggc 
cgagaagggc 
gaccacgtca 
atatagggcc 
ttacttctgt 
aaatataaaa 
ttagattcca 
aaacctgttt 
acattctact 
attgctaagt 
ttacaccaca 
aacctttata 
caggcataga 
aatttgtaaa 
tcagccatac 
tgaacctgaa 
atggttacaa 
attctagttg 
ggaatgtgtg 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 

gaggaggctt 
agggcgcaag 
ctgaccccgg 
gagaaagcag 
gacagcaagc 
caaagtaaac 
tgatcaagag 
ttctccggcc 
ctgctctgat 
gaccgacctg 
ggccacgacg 
ctggctgcta 
cgagaaagta 
ctgcccattc 
cggtcttgtc 
gttcgccagg 
tgcctgcttg 
ccggctgggt 
agagcttggc 
ttcgcagcgc 
ttcgaaatga 
gccttctatg 
cagcgcgggg 
tggttcagct 
gtcggcatcc 
tggggaggca 
tgtgacataa 
tttaagtgta 



aaagcatttg 
acaaaaagaa 
ctccaggaag 
gttgagcatg 
agtttggggc 
cgagttgggc 
agaagcaatg 
tcgctggtca 
ggcaaaggcc 
ggcgagcttg 
tggcccacaa 
ggcagctgtt 
gtgcgcaggt 
aagacttcaa 
cgggttataa 
ggtgtgaeat 

tttttaagtg 
acctatggaa 
tgctcagaag 
cctccaaaaa 
tttttgagtc 
aaggaaaaag 
agtaggcata 
gtgtctgcta 
ggggttaata 
cacatttgta 
acataaaatg 
ataaagcaat 
tggtttgtcc 
tcagttaggg 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
ggctgctaaa 
atgaatgtca 
gtagcttgca 
gaaccggaat 
tggatggctt 
acaggatgag 
gcttgggtgg 
gccgccgtgt 
tccggtgccc 
ggcgttcctt 
ttgggcgaag 
tccatcatgg 
gaccaccaag 
gatcaggatg 
ctcaaggcgc 
ccgaatatca 
gtggcggacc 
ggcgaatggg 
atcgccttct 
ccgaccaagc 
aaaggttggg 
atctcatgct 
gctgcctgag 
aggaaaccag 
cgatggccgc 
ttggacaaac 
taatgtgtta 



3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
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aactactgat 
agcagtggtg 
gtgatgatga 
tagaagaccc 
gtaatagaac 
acaagaaaat 
ataacatact 
ctcaaaaatt 
tgtatagtgc 
gctttaaaaa 
gttgttaact 
ttcacaaata 
gtatcttatc 
ctctacttga 
aattaggtca 
ggtaatttta 
cagcccacaa 
accctgctca 
attttcccca 
aacccagcac 
catctgctga 
cctgtagttt 
aaaatttgac 
atgcaagttt 
catcaaaata 
cgaaagggcc 
tagacgtcag 
taaatacatt 
tattgaaaaa 
gcggcatttt 
gaagatcagt 
cttgagagtt 
tgtggcgcgg 
tattctcaga 
atgacagtaa 
ttacttctga 
gatcatgtaa 
gagcgtgaca 
gaactactta 
gcaggaccac 
gccggtgagc 
cgtatcgtag 
atcgctgaga 
tatatacttt 
ctttttgata 
gaccccgtag 
tgcttgcaaa 
ccaactcttt 
ctagtgtagc 
gctctgctaa 
ttggactcaa 
tgcacacagc 
ctatgagaaa 
agggtcggaa 
agtcctgtcg 
gggcggagcc 
tggccttttg 
accgcctttg 
gtgagcgagg 
atttcacacc 
cagtatctgc 
gctacaacaa 



tctaattgtt 
gaatgccttt 
ggctactgct 
caaggacttt 
tcttgcttgc 
tatggaaaaa 
gttttttctt 
gtgtaccttt 
cttgactaga 
acctcccaca 
tgtttattgc 
aagcattttt 
atgtctggat 
gaggacattc 
cttaacaaaa 
aaatatctgg 
atgtcaacag 
tcaagaagca 
cctgtgtagg 
tccactggat 
ctgtcaactg 
gctaacacac 
ccttgaatgg 
aacatagcag 
tttccacagg 
tcgtgatacg 
gtggcacttt 
caaatatgta 
ggaagagtat 
gccttcctgt 
tgggtgcacg 
ttcgccccga 
tattatcccg 
atgacttggt 
gagaattatg 
caacgatcgg 
ctcgccttga 
ccacgatgcc 
ctctagcttc 
ttctgcgctc 
gtgggtctcg 
ttatctacac 
taggtgcctc 
agattgattt 
atctcatgac 
aaaagatcaa 
caaaaaaacc 
ttccgaaggt 
cgtagttagg 
tcctgttacc 
gacgatagtt 
ccagcttgga 
gcgccacgct 
caggagagcg 
ggtttcgcca 
tatggaaaaa 
ctcacatgtt 
agtgagctga 
aagcggaaga 
gcatatggtg 
tccctgcttg 
ggcaaggctt 



tgtgtatttt 
aatgaggaaa 
gactctcaac 
ccttcagaat 
tttgctattt 
tattctgtaa 
actccacaca 
agctttttaa 
gatcataatc 
cctccccctg 
agcttataat 
ttcactgcat 
ccccaggaag 
caatcatagg 
aggaaattgg 
gaagtccctt 
cagaaacata 
ctgtggttgc 
ttccaaaata 
aagcattatc 
tagcattttt 
cctgcagctc 
gttttccagc 
ttaccccaat 
ttaagtcctc 
cctattttta 
tcggggaaat 
tccgctcatg 
gagtattcaa 
ttttgctcac 
agtgggttac 
agaacgtttt 
tgttgacgcc 
tgagtactca 
cagtgctgcc 
aggaccgaag 
tcgttgggaa 
tgcagcaatg 
ccggcaacaa 
ggcccttccg 
cggtatcatt 
gacggggagt 
actgattaag 
aaaacttcat 
caaaatccct 
aggatcttct 
accgctacca 
aactggcttc 
ccaccacttc 
agtggctgct 
accggataag 
gcgaacgacc 
tcccgaaggg 
cacgagggag 
cctctgactt 
cgccagcaac 
ctttcctgcg 
taccgctcgc 
gcgcctgatg 
cactctcagt 
tgtgttggag 
gaccgacaat 



agattccaac 
acctgttttg 
attctactcc 
tgctaagttt 
acaccacaaa 
cctttataag 
ggcatagagt 
tttgtaaagg 
agccatacca 
aacctgaaac 
ggttacaaat 
tctagttgtg 
ctcctctgtg 
ctgcccatcc 
gtaggggttt 
ccactgctgt 
caagctgtca 
tgtgttagta 
tctagtgttt 
cttatccaaa 
tggggttaca 
caaaggttcc 
accattttca 
aacctcagtt 
atttaaatta 
taggttaatg 
gtgcgcggaa 
agacaataac 
catttccgtg 
ccagaaacgc 
atcgaactgg 
ccaatgatga 
gggcaagagc 
ccagtcacag 
ataaccatga 
gagctaaccg 
ccggagctga 
gcaacaacgt 
ttaatagact 
gctggctggt 
gcagcactgg 
caggcaacta 
cattggtaac 
ttttaattta 
taacgtgagt 
tgagatcctt 
gcggtggttt 
agcagagcgc 
aagaactctg 
gccagtggcg 
gcgcagcggt 
tacaccgaac 
agaaaggcgg 
cttccagggg 

gagcgtcgat 
, gcggccttt t 
ttatcccctg 
cgcagccgaa 
cggtattttc 
acaatctgct 
gtcgctgagt 
tgcatgaaga 



ctatggaact 
ctcagaagaa 
tccaaaaaag 
tttgagtcat 
ggaaaaagct 
taggcataac 
gtctgctatt 
ggttaataag 
catttgtaga 
ataaaatgaa 
aaagcaatag 
gtttgtccaa 
tcctcataaa 
accctctgtg 
ttcacagacc 
gttccagaag 
gctttgcaca 
atgtgcaaaa 
tcatttttac 
acagccttgt 
gtttgagcag 
ccaccaacag 
tgagtttttt 
ttaacagtaa 
ggcaaaggaa 
tcatgataat 
cccctatttg 
cctgataaat 
tcgcccttat 
tggtgaaagt 
atctcaacag 
gcacttttaa 
aactcggtcg 
aaaagcatct 
gtgataacac 
cttttttgca 
atgaagccat 
tgcgcaaact 
ggatggaggc 
ttattgctga 
ggccagatgg 
tggatgaacg 
tgtcagacca 
aaaggatcta 
tttcgttcca 
tttttctgcg 
gtttgccgga 
agataccaaa 
tagcaccgcc 
ataagtcgtg 
cgggctgaac 
tgagatacct 
acaggtatcc 
gaaacgcctg 
ttttgtgatg 
tacggttcct 
attctgtgga 
cgaccgagcg 
tccttacgca 
ctgatgccgc 
agtgcgcgag 
atctgcttag 



gatgaatggg 
atgccatcta 
aagagaaagg 
gctgtgttta 
gcactgctat 
agttataatc 
aataactatg 
gaatatttga 
ggttttactt 
tgcaattgtt 
catcacaaat 
actcatcaat 
ccctaacctc 
tcctcctgtt 
gctttctaag 
tgttggtaaa 
agggcccaac 
caggaggcac 
ttggatcagg 
ggtcagtgtt 
gatatttggt 
caaaaaaatg 
gtgtccctga. 
cagcttccca 
ttcttgaaga 
aatggtttct 
tttatttttc 
gcttcaataa 
tccctttttt 
aaaagatgct 
cggtaagatc 
agttctgcta 
ccgcatacac 
tacggatggc 
tgcggccaac 
caacatgggg 
accaaacgac 
attaactggc 
ggataaagtt 
taaatctgga 
taagccctcc 
aaatagacag 
agtttactca 
ggtgaagatc 
ctgagcgtca 
cgtaatctgc 
tcaagagcta 
tactgtcctt 
tacatacctc 
tcttaccggg 
ggggggttcg 
acagcgtgag 
ggtaagcggc 
gtatctttat 
ctcgtcaggg 
ggccttttgc 
taaccgtatt 
cagcgagtca 
tctgtgcggt 
atagttaagc 
caaaatttaa 
ggttaggcgt 



7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
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tttgcgctgc ttcgcgatgt acgggccaga tatacgcgta 
tttaggcgaa aagcggggct tcggttgtac gcggttagga 
tttcgctttt gcatagggag ggggaaatgt agtcttatgc 
acatggtaac gatgagttag caacatgcct tacaaggaga 
gattggtgga agtaaggtgg tacgatcgtg ccttattagg 
catggattgg acgaaccact 



tctgagggga ctagggtgtg 11340 
gtcccctcag gatatagtag 11400 
aatacacttg tagtcttgca 11460 
gaaaaagcac cgtgcatgcc 11520 
aaggcaacag acgggtctga 11580 

11600 



<210> 36 
<211> 53 
<212 > DNA 

<213 > Artificial Sequence 



< 220 > 

<223 > Description of Artificial Sequence: primer 
<400> 36 

gtcactcgag gactcggtcg actgaaaatg agacatatta tctgccacgg acc 



<210> 37 
<211> 36 
<212 > DNA 

<213 > Artificial Sequence 

< 220 > . 

<223> Description of Artificial Sequence: primer 

<400> 37 

cgagatcgat cacctccggt acaaggtttg gcatag 

<210> 38 
<211> 37 
<212> DNA 

<213> Artificial Sequence 

<220> f . 

<223 > Description of Artificial Sequence: primer 
<400> 38 

catgaagatc tggaaggtgc tgaggtacga tgagacc 

<210> 39 
<211> 51 
<212> DNA 

<213 > Artificial Sequence 

< 220 > „ , _ 

<223 > Description of Artificial Sequence:, primer 

<400 > 39 

gcgacttaag cagtcagctg agacagcaag acacttgctt gatccaaatc c 



<210> 40 
<211> 38 
<212> DNA 

<213> Artificial Sequence 



< 220 > 

<223> Description of 



Jurtificial Sequence: primer 



<400> 40 

cacgaattcg tcagcgcttc tcgtcgcgtc caagac.cc 



38 
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<210> 41 
<211> 32 
<212> DNA 

<213 > Artificial Sequence 



<223> Description of Artificial Sequence: primer 

<400> 41 _ 32 

caccccgggg aggcggcggc gacggggacg gg 

<210> 42 
<211> 7231 
<212> DNA 

<213 > Artificial Sequence 



<223> Description of Artificial Sequence: plasmid pDVSO 



<400> 42 

ctgctccctg 

acaaggcaag 

ctgcttcgcg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 

ccgagctcgg 

tactccctct 

gaggatttga 

tcagaaaaga 

gacagcaact 

gccgcgatgt 

gtgcgctcgt 

acgctggtgg 

cgcgagcaga 

gtaaagaccc 

agcgcctgct 

catggcatgg 

ggctctctga 

acgtaatcgt 

ggctgctctg 

ggacgctgga 

gcgtaggagt 

tagtccaggg 

cggttgagga 

tccgaacgag 

ggaaaacctc 

gcccgcccca 

aatatcccct 

ggggtactgt 

aaggtgggag 

ctgcaagtta 

gctaataaac 



cttgtgtgtt 

gcttgaccga 

atgtacgggc 

ttacggggtc 

atggcccgcc 

ttcccatagt 

aaactgccca 

tcaatgacgg 

ctacttggca 

agtacatcaa 

ttgacgtcaa 

acaactccgc 

gcagagctct 

ctcactatag 

atccactctc 

gaaaagcggg 

tattcacctg 

caatcttttt 

tggcgatgga 

ttagctgcac 

cgggcaccag 

ctacctctcc 

atggcggtag 

cgggcagcag 

gccatgcgcg 

ggtgggtgag 

gtattccaag 

atagttcgtg 

ctcggaagac 

agacgttgaa 

cgcgcagctt 

tttccttgat 

caaactcttc 

atccgtactc 

togagaaagg 

gcgaagatga 

tcctcactcc 

cactcaaact 

gtggtctcac 

atactgataa 

ttagtttaaa 



ggaggtcgct 

caattgcatg 

cagatatacg 

attagttcat 

tggctgaccg 

aacgccaata 

cttggcagta 

taaatggccc 

gtacatctac 

tgggcgtgga 

tgggagtttg 

cccattgacg 

ctggctaact 

ggagacccaa 

ttccgcatcg 

catgacttct 

gcccgcggtg 

gttgtcaagc 

gcgcagggtt 

gtattcgcgc 

gtgcacgcgc 

gcgtaggcgc 

ggggtctagc 

gcgcgcgtcg 

ggcggcaagc 

cgcggaggcg 

atatgtaggg 

cgagggagcg 

tatctgcctg 

gctggcgtct 

gttgaccagc 

gatgtcatac 

gcggtctttc 

cgccgccgag 

cgtctaacca 

cttcaacccc 

cccctttgtc 

ggctgatcca 

tttgcaagat 

aaaacttgag 

agtaggacat 



gagtagtgcg 

aagaatctgc 

cgttgacatt 

agcccatata 

cccaacgacc 

gggactttcc 

catcaagtgt 

gcctggcatt 

gtattagfcca 

tagcggtttg 

ttttggcacc 

caaatgggcg 

agagaaccca 

gctggctagc 

ctgtctgcga 

gcgctaagat 

atgcctttga 

ttggtggcaa 

tggtttttgt 

gcaacgcacc 

caaccgcggt 

tcgttggtcc 

tgcgtctcgt 

aagtagtcta 

gcgcgctcgt 

tacatgccgc 

tagcatcttc 

aggaggtcgg 

aagatggcat 

gtgagaccta 

tcggcggtga 

ttatcctgtc 

cagtactctt 

ggacctgagc 

gtcacagtcg 

gtctacccct 

tcctccgatg 

atcaccatta 

ggaagcctaa 

cttgcatatg 

ggattaaaag 



cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 

gtttaaactt 

gggccagctg 

tgtcagtttc 

gggtggccgc 

acgacccgta 

cgcgatcggc 

gccattcggg 

tgtgcagggt 

agcagaggcg 

ccggggggtc 

tcttgcatcc 

atgggttgag 

aaatgtcgta 

caccgcggat 

gaccgaggtt 

gtgagttgga 

ccgcgtcacg 

cctgcacgtc 

cctttttttt 

ggatcggaaa 

gagtccgcat 

caagatccaa 

atggctacgc 

gattcaaaaa 

ccaatgggga 

ctgtaaaccc 

ataatccatt 

tattagatga 



ttaagctaca 

gcgttttgcg 

tagttattaa 

cgttacataa 

gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 

aagcttggta 

ttggggtgag 

caaaaacgag 

atccatctgg 

gagggcgttg 

gcgctccttg 

aaagacggtg 

gacaaggtca 

gccgcccttg 

tgcgtccacg 

ttgcaagtct 

tgggggaccc 

aacgtagagg 

gctggcgcgc 

gctacgggcg 

tgatatggtt 

cacgaaggag 

tagggcgcag 

ccacagctcg 

cccgtcggcc 

cgaccggatc 

gatgaagcgc 

gcggaatcag 

cttcccccct 

tgtatccctc 

taaggctcca 

tgaaagtagt 

aaaaagtgct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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gcggggttaa 
gaaaatttag 
gaagggttga 
cgcacacttt 
tctaaactca 
attgtggtcg 
tttactatta 
aaagcttatt 
attggtttta 
agagacatag 
aaaactacct 
tccaaaacct 
caagaatgaa 
cgactgtgcc 
ccctggaagg 
gtctgagtag 
attgggaaga 
aaagaaccag 
cggcgggtgt 
ctcctttcgc 
taaatcgggg 
aacttgatta 
ctttgacgtt 
tcaaccctat 
ggttaaaaaa 
tcagttaggg 
atctcaatta 
tgcaaagcat 
cgcccctaac 
tttatgcaga 
tttttggagg 
ctgatcagca 
caaggtgagg 
cgacgtcgcc 
ggaggacgac 
ggaccaggtg 
gtacgccgag 
gaccgagatc 
ctgcgtgcac 
cgccgccttc 
cctccagcgc 
ttataatggt 
actgcattct 
gtcgacctct 
ttatccgctc 
tgcctaatga 
gggaaacctg 
gcgtattggg 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 
cgccttatcc 
ggcagcagcc 
cttgaagtgg 
gctgaagcca 
cgctggtagc 
tcaagaagat 
ttaagggatt 



aagatttaat 
aaaatacaga 
catttgacaa 
ggacaacacc 
ctttggtact 
caggaaagta 
aactgctatt 
ggaactttag 
tgcctaattt 
tttatggaac 
ttaaccaaga 
atgaaaatgt 
agagcggccg 
ttctagttgc 
tgccactccc 
gtgtcattct 
caatagcagg 
ctggggctct 
ggtggttacg 
tttcttccct 

catcccttta 
gggtgatggt 
ggagtccacg 
ctcggtctat 
tgagctgatt 
tgtggaaagt 
gtcagcaacc 
gcatctcaat 
tccgcccagt 
ggccgaggcc 
cctaggcttt 
cgtgttgaca 
aactaaacca 
ggagcggtcg 
ttcgccggtg 
gtgccggaca 
tggtcggagg 
ggcgagcagc 
ttcgtggccg 
tatgaaaggt 
ggggatctca 
tacaaataaa 
agttgtggtt 
agctagagct 
acaattccac 
gtgagctaac 
tcgtgccagc 
cgctcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 
cgctccaagc 
ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 
ggtggttttt 
cctttgatct 
ttggtcatga 



tggcaaactt 
tggtagcagc 
tgatggatac 
agacacatct 
tacaaagtgt 
ccacatcata 
taataagaac 
aagtggaaat 
ggtagcgtat 
tatatatctt 
aactggatgt 
tgaatttgaa 
ctcgagtcta 
cagccatctg 
actgtccttt 
attctggggg 
catgctgggg 
agggggtatc 
cgcagcgtga 
tcctttctcg 
gggttccgat 
tcacgtagtg 
ttctttaata 
tcttttgatt 
taacaaaaat 
ccccaggctc 
aggtgtggaa 
tagtcagcaa 
tccgcccatt 
gcctctgcct 
tgcaaaaagc 
attaatcatc 
tggccaagtt 
agttctggac 
tggtccggga 
acaccctggc 
tcgtgtccac 
cgtgggggcg 
aggagcagga 
tgggcttcgg 
tgctggagtt 
gcaatagcat 
tgtccaaact 
tggcgtaatc 
acaacatacg 
tcacattaat 
tgcattaatg 
cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 
gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 



gtggttttaa 
agaggaattg 
ttggtagcat 
ccaaactgca 
ggaagtcaaa 
aataataaga 
ggagtgcttt 
tccaatgttt 
ccaaaaccca 
ggtggaaaac 
gaatactcta 
accacctctt 
gagggcccgt 
ttgtttgccc 
cctaataaaa 
gtggggtggg 
atgcggtggg 
cccacgcgcc 
ccgctacact 
ccacgttcgc 
ttagtgcttt 
ggccatcgcc 
gtggactctt 
tataagggat 
ttaacgcgaa 
cccaggcagg 
agtccccagg 
ccatagtccc 
ctccgcccca 
ctgagctatt 
tcccgggagc 
ggcatagtat 
gaccagtgcc 
cgaccggctc 
cgacgtgacc 
ctgggtgtgg 
gaacttccgg 
ggagttcgcc 
ctgacacgtg 
aatcgttttc 
cttcgcccac 
cacaaatttc 
catcaatgta 
atggtcatag 
agccggaagc 
tgcgttgcgc 
aatcggccaa 
cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
atgctcacgc 
gcacgaaccc 
caacccggta 
agcgaggtat 
tagaaggaca 
tggtagctct 
gcagcagatt 
gtctgacgct 
aaggatcttc 



caggaaaagg 
gtataaatgt 
ggaacccaaa 
caattgctca 
tattagctaa 
caaatccaaa 
tagacaactc 
cgacagctta 
gtaattctaa 
ctgatcagcc 
tcacatttaa 
ttaccttctc 
ttaaacccgc 
ctcccccgtg 
tgaggaaatt 
gcaggacagc 
ctctatggct 
ctgtagcggc 
tgccagcgcc 
cggctttccc 
acggcacctc 
ctgatagacg 
gttccaaact 
tttggggatt 
ttaattctgt 
cagaagtatg 
ctccccagca 
gcccctaact 
tggctgacta 
ccagaagtag 
ttgtatatcc 
atcggcatag 
gttccggtgc 
gggttctccc 
ctgttcatca 
gtgcgcggcc 
gacgcctccg 
ctgcgcgacc 
ctacgagatt 
cgggacgccg 
cccaacttgt 
acaaataaag 
tcttatcatg 
ctgtttcctg 
ataaagtgta 
tcactgcccg 
cgcgcgggga 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 
agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 



aataggcact 2520 
aagagcaaga 2580 
gtatgacacg 2640 
agataaggac 2700 
tgtgtctttg 2760 
aataaaaagt 2820 
aaatcttgga 2880 
tgaaaaagca 2 940 
aaaatatgca 3000 
agcagtcatt 3060 
ctttagttgg 3120 
ctatattgcc 3180 
tgatcagcct 3240 
ccttccttga 3300 
gca.tcgcatt 33 60 
aagggggagg 3420 
tctgaggcgg 3480 
gcattaagcg 3540 
ctagcgcccg 3600 
cgtcaagctc 3660 
gaccccaaaa 3720 
gtttttcgcc 3780 
ggaacaacac 3840 
tcggcctatt 3900 
ggaatgtgtg 3960 
caaagcatgc 4020 
ggcagaagta 4080 
ccgcccatcc 4140 
atttttttta 4200 
tgaggaggct 4260 
attttcggat 4320 
tataatacga 4380 
tcaccgcgcg 4440 
gggacttcgt 4500 
gcgcggtcca 4560 
tggacgagct 4620 
ggccggccat 4680 
cggccggcaa 4740 
tcgatt.ccac 4800 
gctggatgat 4860 
ttattgcagc 4920 
catttttttc 4980 
tctgtatacc 5040 
tgtgaaattg 5100 
aagcctgggg 5160 
ctttccagtc 5220 
gaggcggttt 5280 
tcgttcggct 5340 
aatcagggga 5400 
gtaaaaaggc 5460 
aaaatcgacg 5520 
ttccccctgg 5580 
tgtccgcctt 5640 
tcagttcggt 5700 
ccgaccgctg 5760 
tatcgccact 5820 
ctacagagtt 5880 
tctgcgctct 5940 
aacaaaccac 6000 
aaaaaggatc 6060 
aaaactcacg 6120 
ttttaaatta 6180 
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aaaatgaagt 

atgcttaatc 

ctgactcccc 

tgcaatgata 

agccggaagg 

taattgttgc 

tgccattgct 

cggttcccaa 

ctccttcggt 

tatggcagca 

tggtgagtac 

cccggcgtca 

tggaaaacgt 

gatgtaaccc 

tgggtgagca 

atgttgaata 

tctcatgagc 

cacatttccc 



tttaaatcaa 

agtgaggcac 

gtcgtgtaga 

ccgcgagacc 

gccgagcgca 

cgggaagcta 

acaggcatcg 

cgatcaaggc 

cctccgatcg 

ctgcataatt 

tcaaccaagt 

atacgggata 

tcttcggggc 

actcgtgcac 

aaaacaggaa 

ctcatactct 

ggatacatat 

cgaaaagtgc 



tctaaagtat 

ctatctcagc 

taactacgat 

cacgctcacc 

gaagtggtcc 

gagtaagtag 

tggtgtcacg 

gagttacatg 

ttgtcagaag 

ctcttactgt 

cattctgaga 

ataccgcgcc 

gaaaactctc 

ccaactgatc 

ggcaaaatgc 

tcctttttca 

ttgaatgtat 

cacctgacgt 



atatgagtaa 

gatctgtcta 

acgggagggc 

ggctccagat 

tgcaacttta 

ttcgccagtt 

ctcgtcgttt 

atcccccatg 

taagttggcc 

catgccatcc 

atagtgtatg 

acatagcaga 

aaggatctta 

ttcagcatct 

cgcaaaaaag 

atattattga 

ttagaaaaat 



acttggtctg 

tttcgttcat 

ttaccatctg 

ttatcagcaa 

tccgcctcca 

aatagtttgc 

ggtatggctt 

ttgtgcaaaa 

gcagtgttat 

gtaagatgct 

cggcgaccga 

actttaaaag 

ccgctgttga 

tttactttca 

ggaataaggg 

agcatttatc 

aaacaaatag 



acagttacca 6240 
ccatagttgc 6300 
gccccagtgc 6360 
taaaccagcc 6420 
tccagtctat 6480 
gcaacgttgt 6540 
cattcagctc 6600 
aagcggttag 6660 
cactcatggt 6720 
tttctgtgac 6780 
gttgctcttg 6840 
tgctcatcat 6900 
gatccagttc 6960 
ccagcgtttc 7020 
cgacacggaa 7080 
agggttattg 7140 
gggttccgcg 7200 
7231 



<210> 43 
< 211 > 48 
<212 > DNA 

<213 > Artificial Sequence 



Description of Jsrtifici.l Sequence . primer 
‘tgJo’ttgU ccaagatgaa gcgcgcccgc cco.gcga.g atgaettc 



<210> 44 
<211> 28 
<212 > DNA 

<213> Artificial Sequence 



<Il3> Description of Artificial Sequence: primer 
aaacacggcg gccgctcttt cattcttg 



<210> 45 
< 211 > 17 

<212 > PRT . - 

<213 > Ad 37 N- terminus 



<400> 45 
Met Ser Lys Arg 
1 



Leu Arb Val Glu Asp Asp 



Phe Asn Pro 



Val Tyr Pro Tyr 

15 



<210> 46 
< 211 > 6 
<212> PRT 

<213 > artificial sequence 



<400> 46 

Lys Arg Ala Arg Pro Ser 
1 5 
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<212 > PRT 

<213> Ad5 modified N-terminus 

Met ° Ly s 7 Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro Tyr 
1 5 10 15 



<210> 48 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> Description of Artificial Sequence: primer 
<400> 48 

ggatccatgg gatacttggt agca 

<210> 49 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> Description of Artificial Sequence: primer 



<400> 49 

gcaactcgag tcattcttgg gcaatatagg 



<210 > 50 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



30 



< 220 > 

<223> Description 



of Artificial Sequence: primer 



<400> 50 

cgcgctgact cttaaggact agtttc 



<21Q> 51 
<211> 37 
<212 > DNA 

<213> Artificial Sequence 



26 



< 220 > 

<223> Description of Artificial Sequence: primer 
<400> 51 

gcgcttaatt aacatcatca ataatatacc ttatttt 



37 




